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REGULATION OF HUMAN RECEPTOR TYROSINE PHOSPHATASE 
FIELD OF THE INVENTION 

5 The invention relates to the regulation of human receptor tyrosine phosphatase. 
BACKGROUND OF THE INVENTION 

Phosphorylation of proteins is a fundamental mechanism for regulating diverse 
10 cellular processes. U.S. Patent 5,952,212. While the majority of protein phosphor- " 
ylation occurs at serine and threonine residues, phosphorylation at tyrosine residues- 
is attracting a great deal of interest since the discovery that many oncogene products 
and growth factor receptors possess intrinsic protein tyrosine kinase activity. The 
importance of protein tyrosine phosphorylation in growth factor signal transduction, 
15 cell cycle progression and neoplastic transformation is now well established. Hunter 
et al, Ann. Rev. Biochem. 54, 987-30, 1985; Ullrich etal, Cell 61, 203-12, 1990; 
Nurse, Nature 344, 503-08, 1990; Cantley et al, Cell 64, 281-302, 1991. 

Biochemical studies have shown that phosphorylation on tyrosine residues of a 
20 variety of cellular proteins is a dynamic process involving competing phosphor- 
ylation and dephosphorylation reactions. The regulation of protein , tyrosine 
phosphorylation is mediated by the reciprocal actions of protein tyrosine kinases 
(PTKases or PTKS) and protein tyrosine phosphatases (PTPs). The tyrosine 
phosphorylation reactions are catalyzed by PTKs. Tyrosine phosphorylated proteins 
25 can be specifically dephosphorylated through the action of PTPs. The level of 
protein tyrosine phosphorylation of intracellular substances is determined by the 
balance of PTK and PTP activities. Hunter, Cell 58, 1013-16, 1989. 
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REGULATION OF HUMAN RECEPTOR TYROSINE PHOSPHATASE 
Protein Tyrosine Kinases 

The protein tyrosine kinases (PTKS; ATP;protein-tyrosine O-phosphotransferase, EC 
5 2.7.1.112) are a large family of proteins that includes many growth factor receptors 
and potential oncogenes. Hanks et al, Science 241 , 42-52, 1988. Many PTKs have 
been linked to initial signals required for induction of the cell cycle. Weaver et al, 
Mol Cell Biol 11(9), 4415-22, 1991. PTKs comprise a discrete family of enzymes 
having common ancestry with, but major differences from, serine/threonine-specific 

10 protein kinases. Hanks et al, supra. The mechanisms leading to changes in activity 
of PTKs are best understood in the case of receptor-type PTKs having a trans- 
membrane topology. Ullrich et al, 1990, supra. The binding of specific ligands to 
the extracellular domain of members of receptor-type PTKs is thought to induce their 
oligomerization, leading to an increase in tyrosine kinase activity and activation of 

15 the signal transduction pathways. Ullrich etal, 1990, supra. Deregulation of kinase 
activity through mutation or overexpression is a well-established mechanism for cell 
transformation. Hunter etal, 1985; Ullrich <tf al, 1990. 

Protein Tyrosine Phosphatases 

20 

The protein phosphatases are composed of at least two separate and distinct families: 
the protein serine/threonine phosphatases and the protein tyrosine phosphatases 
(PTPs; protein-tyrosine-phosphate phosphohydrolase, EC 3.13.48. Hunter, 1989. 
The PTPs are a family of proteins that have been classified into two subgroups. The 

25 first subgroup is made up of the low molecular weight, intracellular enzymes that 
contain a single conserved catalytic phosphatase domain. All known intracellular 
type PTPs contain a single conserved catalytic phosphatase domain. Examples of the 
first group of PTPs include (1) placental PTP IB (Charbonneau et al, Proc. Natl ' 
Acad. Sci USA 86, 5252-56, 1989; Chernoff et al, Proc. Natl Acad. Sci. USA 87, 

30 2735-89, 1990, (2) T-cell PTP (Cool et al, Proc. Natl Acad. Sci. USA 86, 5257-61, 
1989, (3) rat brain PTP (Guan et al, Proc. Natl. Acad. Sci. USA 87, 1501-02, 1990, 
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(4) neuronal phosphatase (STEP) (Lombroso et al, Proc. Natl Acad Set USA 88, 
12A2-A6, 1991, and (5) cytoplasmic phosphatases that contain a region of homology 
to cytoskeletal proteins (Gu et al, Proc. Natl Acad Set USA 88, 5867-871, 1991; 
Yang et al, Proc. Natl Acad. Set USA 88, 5949-53, 1991. 

5 

The second subgroup is made up of the high molecular weight, receptor-linked PTPs, 
termed RPTPs. RPTPs consist of (a) an intracellular catalytic region, (b) a single 
transmembrane segment, and (c) a putative ligand-binding extracellular domain. The 
structures and sizes of the putative ligand-binding extracellular "receptor" domains of 
10 RPTPs are quite divergent. In contrast, the intracellular catalytic regions of RPTPs 
are highly homologous. All RPTPs have two tandemly duplicated catalytic phos- 
phatase homology domains, with the prominent exception of an RPTP termed 
HPTPP, which has only one catalytic phosphatase domain. Tsai et ah, J. Biol Chem. 
255,0534-43, 1991. 

15 

Since the initial purification, sequencing, and cloning of a PTP, additional potential 
PTPs have been identified at a rapid pace. The number of different PTPs that have 
been identified is increasing steadily, leading to speculations that this family may be 
as large as the PTK family. Hunter, 1989. 

20 

Conserved amino acid sequences designated "consensus sequences" have been 
identified in the catalytic domains of known PTPs. Rrueger et al, EMBO J. 9, 
3241-52, 1990; Yi et al, Mol Cell Biol 12, 836-46, 1992. Yi et al aligned the 
catalytic phosphatase domain sequences of the following PTPs: LCA, PTP IB, 
25 TCPTP, LAR, DLAR, and HPTPa, HPTPp, and HPTPy. This alignment includes 
the following consensus sequences (Yi et al, supra): 

1. KCXXYWP 

2. HCSXGXGRXG. 

Krueger et al., aligned the catalytic phosphatase domain sequences of 
30 PTP1B, TCPTP, LAR, LCA, HPTPa, p, 8, 6 and £ and DLAR and DPTP. 
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This alignment includes the following consensus sequences (Krueger et aL, 
supra): 

1. KCXXYWP 

2. HCSXGXGRXG. 

5 

It is becoming clear that dephosphorylation of tyrosine residues can by itself function 
as an important regulatory mechanism. Dephosphorylation of a C-teiminal tyrosine 
residue has been shown to activate tyrosine kinase activity in the src family of 
tyrosine kinases. Hunter, Cell 49 y 1-4, 1987. Tyrosine dephosphorylation has been 
10 suggested to be an obligatory step in the mitotic activation of the 
maturation-promoting factor (MPF) kinase. Morla et ah, Cell 58, 193-203, 1989. 
. These observations point out the need in the art for understanding the mechanisms 
that regulate tyrosine phosphatase activity. 

15 Because of the importance of these enzymes in a variety of cellular processes, there 
is a continuing need in the art to identify additional tyrosine phosphatases, which can 
be regulated to provide therapeutic effects. 

BRIEF SUMMARY OF THE INVENTION 

20 

It is an object of the invention to provide reagents and methods of regulating a 
human receptor tyrosine phosphatase. This and other objects of the invention are 
provided by one or more of the embodiments described below. 

25 One embodiment of the invention is a receptor tyrosine phosphatase polypeptide 
comprising an amino acid sequence selected from the group consisting of: 

amino acid sequences which are at least about 87% identical to the amino acid' 
sequence shown in SEQ ID NO: 2; and 

30 

the amino acid sequence shown in SEQ ID NO: 2. 
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Yet another embodiment of the invention is a method of screening for agents which 
decrease extracellular matrix degradation. A test compound is contacted with a 
receptor tyrosine phosphatase polypeptide comprising an amino acid sequence 
5 selected from the group consisting of: 

amino acid sequences which are at least about 87% identical to the amino acid 
sequence shown in SEQ ID NO: 2; and 

10 the amino acid sequence shown in SEQ ID NO: 2. 

' Binding between the test compound and the receptor tyrosine phosphatase 
polypeptide is detected. A test compound which binds to the receptor tyrosine phos- 
phatase polypeptide is thereby identified as a potential agent for decreasing extra- 
15 cellular matrix degradation. The agent can work by decreasing the activity of the 
receptor tyrosine phosphatase. 

Another embodiment of the invention is a method of screening for agents which 
decrease extracellular matrix degradation. A test compound is contacted with a poly- 
20 nucleotide encoding a receptor tyrosine phosphatase polypeptide, wherein the poly- 
nucleotide comprises a nucleotide sequence selected from the group consisting of: 

nucleotide sequences which are at least about 50% identical to the nucleotide 
sequence shown in SEQ ID NO: 1 ; and 

25 

the nucleotide sequence shown in SEQ ID NO: 1 . 

Binding of the test compound to the polynucleotide is detected. A test compound - 
which binds to the polynucleotide is identified as a potential agent for decreasing 
30 extracellular matrix degradation. The agent can work by decreasing the amount of the 
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receptor tyrosine phosphatase through interacting with the receptor tyrosine phos- 
phatase raJRNA. 

Another embodiment of the invention is a method of screening for agents which 
5 regulate extracellular matrix degradation- A test compound is contacted with a 
receptor tyrosine phosphatase polypeptide comprising an amino acid sequence 
selected from the group consisting of: 

amino acid sequences which are at least about 87% identical to the amino acid 
10 sequence shown in SEQ ID NO: 2; and 

the amino acid sequence shown in SEQ ID NO: 2. 

A receptor tyrosine phosphatase activity of the polypeptide is detected. A test com- 
15 pound which increases receptor tyrosine phosphatase activity of the polypeptide 
relative to receptor tyrosine phosphatase activity in the absence of the test compound 
is thereby identified as a potential agent for increasing extracellular matrix 
degradation. A test compound which decreases receptor tyrosine phosphatase activity 
of the polypeptide relative to receptor tyrosine phosphatase activity in the absence of 
20 the test compound is thereby identified as a potential agent for decreasing extra- 
cellular matrix degradation. 

Even another embodiment of the invention is a method of screening for agents which 
decrease extracellular matrix degraidation. A test compound is contacted with a 
25 receptor tyrosine phosphatase product of a polynucleotide which comprises a- 
nucleotide sequence selected from the group consisting of: 

nucleotide sequences which are at least about 50% identical to the nucleotide* 
sequence shown in SEQ ID NO: 1 ; and 

30 

the nucleotide sequence shown in SEQ ED NO: 1 . 
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Binding of the test compound to the receptor tyrosine phosphatase product is 
detected. A test compound which binds to the receptor tyrosine phosphatase product 
is thereby identified as a potential agent for decreasing extracellular matrix 
5 degradation. 

Still another embodiment of the invention is a method of reducing extracellular 
matrix degradation. A cell is contacted with a reagent which specifically binds- to a 
polynucleotide encoding a receptor tyrosine phosphatase polypeptide or the product 
10 encoded by the polynucleotide, wherein the polynucleotide comprises a nucleotide 
sequence selected from the group consisting of: 

nucleotide sequences which are at least about 50% identical to the nucleotide 
sequence shown in SEQ ID NO: 1; and 

15 

the nucleotide sequence shown in SEQ ID NO: 1. 

Receptor tyrosine phosphatase activity in the cell is thereby decreased. 

20 The invention thus provides a human receptor tyrosine phosphatase that can be used 
to identify test compounds that may act, for example, as activators or inhibitors at the 
enzyme's active site. Human receptor tyrosine phosphatase and fragments thereof 
also are useful in raising specific antibodies that can block the enzyme and 
effectively reduce its activity. 

25 

BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 shows the DNA-sequence encoding a receptor tyrosine 
phosphatase Polypeptide (SEQ ID NO:l). 
30 Fig. 2 shows the amino acid sequence deduced from the DNA- 
sequence of Fig. 1 (SEQIDNO:2). 



WO 03/033688 PCT/EP02/11473 



Fig. 3 shows the amino acid sequence of the protein identified 
by trembl|AF063249|AF063249_J (SEQ ID NO:3). 

Fig. 4 shows the amino acid sequence of a receptor tyrosine 
phosphatase Polypeptide (SEQ ID NO:4). 
5 Fig. 5 shows the amino acid sequence of a receptor tyrosine 
phosphatase Polypeptide (SEQ ID NO:5). 

Fig. 6 shows the DNA-sequence encoding a receptor tyrosine 
phosphatase Polypeptide (SEQ ID NO:6). 

Fig. 7 shows the DNA-sequence encoding a receptor tyrosine 
10 phosphatase Polypeptide (SEQ ID NO:7): 

Fig. 8 shows the DNA-sequence encoding a receptor tyrosine 
phosphatase Polypeptide (SEQ ID NO: 8). 

Fig. 9 shows the BLASTP - alignment of 492 JProtein (SEQ 
ID NO:2) against trembl|AF063249|AF063249_l (SEQ 
15 ID NO:3). This hit is scoring at: 0.0 (expectation 

value); Alignment length (overlap): 2301 ; Identities : 
86 % ; Scoring matrix : BLOSUM62 (used to infer 
consensus pattern) ; Database searched : nrdb_l_. 

Fig. 10 shows the BLASTP - alignment of 492_Protein (SEQ 
20 ID NO:2) against pdb|lLAR|lLAR-A larfragment: 

phosphatase. This hit is scoring at: 3e-60 (expectation 
value); Alignment length (overlap): 308; Identities: 40 
% ; Scoring matrix : BLOSUM62 (used to infer 
consensus pattern) ; Database searched : nrdb_l_. 
25 Fig. 11 shows the BLASTP - alternative alignment of 
492J>rotein (SEQ ID NO:2) against pdb|lLAR|lLAR-. 
A lar fragment: phosphatase. This hit is scoring at: 3e- 
44 (expectation value); Alignment length (overlap): 245; 
Identities: 40 %; Scoring matrix: BLOSUM62 (used to 
30 . infer consensus pattern); Database searched: nrdb_l_. 
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Fig. 12 shows the HMMPFAM - alignment of 492JProtein 
(SBQ ID NO:2) against pfam|hmm|Yjphosphatase 
Protein-tyrosine phosphatase. This hit is scoring at: 
374.0; Scoring matrix: BLOSUM62 (used to infer 
5 consensus pattern). 

Fig. 13 shows the ATP_GTP_A region, transmembrane 
segments, and fibronectin type m domain region 
prediction. ATP_GTP_A region from residue 1621 to 
1629. Source: [prosite]; Tyrosine specific protein 

10 phosphatases signature; -Consensus pattern: [LIVMF]- 

H-C-x(2)-G-x(3)-[STC]-[STAGP]-x-|XIVMFY] [C is 
the active site residue]; ATP/GTP-binding site motif A 
(P-loop); -Consensus pattern: [AG]-x(4)-G-K-[ST]; 
High scoring transmembrane segments: From 1906 to 

15 1928: length= 23, score=69.00 ** 1906 HLSVTLCIL 

SELLGTAIF AFAL: 5(21.7%); A: 3(13.0%); I: 
6(26. 1 %); Fibronectin type in domain region. 
Fig. 14 shows the Genewise output: use rat "glomerular" 
mesangial cell receptor protein-tyrosine phosphatase 

20 precursor" (AF063249) (SEQ.ID NO:3) as a template on 

human genomic DNA (NT_0 1 9546; 
chrl2:8890000188900001-32923-88900001269821); 
Score 3940.84 bits over entire alignment; Scores as bits 
over a synchronous coding model. 

25 Fig. 15 shows the Genewise output: use partial protein 
(492JBAC_p, below) of rat "glomerular mesangial cell 
receptor protein-tyrosine phosphatase precursor" 
(AF063249) (SEQ ID NO:3) as a template on human 
12qBACRPll-288D9(gi|15011617|gb|AC083812.24). 



30 
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DETAILED DESCRIPTION OF THE INVENTION 

The invention relates to an isolated polynucleotide from the group consisting of: 

5 a) a polynucleotide encoding a receptor tyrosine phosphatase 

polypeptide comprising an amino acid sequence selected from the 
group consisting of: 

amino acid sequences which are at least about 87% identical to 
the amino acid sequence shown in SEQ ID NO: 2; and 
10 the amino acid sequence shown in SEQ ID NO: 2. 

b) a polynucleotide comprising the sequence of SEQ ID NO: 1 ; 

c) a polynucleotide which hybridizes under stringent conditions to a 
polynucleotide specified in (a) and (b) and encodes a receptor tyrosine 
phosphatase polypeptide; 

15 d) a polynucleotide the sequence of which deviates from the 

polynucleotide sequences specified in (a) to (c) due to the 
degeneration of the genetic code and encodes a receptor tyrosine 
phosphatase polypeptide; and 
e) a polynucleotide which represents a fragment, derivative or allelic 

20 variation of a polynucleotide sequence specified in (a) to (d) and 

encodes a receptor tyrosine phosphatase polypeptide. 

Furthermore, it has been discovered by the present applicant that a novel receptor 
tyrosine phosphatase, particularly a human receptor tyrosine phosphatase, can be 

25 used in therapeutic methods to treat diabetes, a CNS disorder, COPD or a 
cardiovascular disorder. Human receptor tyrosine phosphatase comprises the amino 
acid sequence shown in SEQ ID NO:2. A coding sequence for human receptor 
tyrosine phosphatase is shown in SEQ ID NO:l. This sequence is located on' 
chromosome 12. Related ESTs (AF169351; AR073855; AR073854; AR073853; 

30 AR073852; AR073851; AR073850) are expressed in colon and K562 CML cell line. 
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Human receptor tyrosine phosphatase is 86% identical over 2301 amino acids to 
glomerular mesangial cell receptor protein-tyrosine phosphatase precursor (FIG. 1). 
Human receptor tyrosine phosphatase of the invention is expected to be useful for the 
same purposes as previously identified receptor tyrosine phosphatase enzymes. 

5 

Human receptor tyrosine phosphatase is believed to be useful in therapeutic methods 
to treat disorders such as diabetes, obesity, a CNS disorder, COPD, and 
cardiovascular disorders. Human receptor tyrosine phosphatase also can be used to 
screen for human receptor tyrosine phosphatase activators and inhibitors. 

10 

Polypeptides 

Human receptor tyrosine phosphatase polypeptides according to the invention 
comprise at least 6, 10, 15, 20, 25, 50, 75, 100, 125, 150, 175, 200, 225, 250, 275, 

15 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 625, 650, 675, 700, 
725, 750, 775, 800, 825, 850, 875, 900, 925, 950, 975, 1000, 1025, 1050, 1075, 1100, 
1125, 1150, 1175, 1200, 1225, 1250, 1275, 1300, 1325, 1350, 1375, 1400, 1425, 
1450, 1475, 1500, 1525, 1550, 1575, 1600, 1625, 1650, 1675, 1700, 1725, 1750, 
1775, 1800, 1825, 1850, 1875, 1900, 1925, 1950, 1975, 2000, 2025, 2050, 2075, 

20 ' 2100, 2125, 2150, 2175, 2200, 2225, 2250, 2275 or 2299 contiguous amino acids 
selected from the amino acid sequence shown in SEQ ID NO:2 or a biologically 
active variant thereof, as defined below. A receptor tyrosine phosphatase 
polypeptide of the invention therefore can be a portion of a receptor tyrosine 
phosphatase protein, a full-length receptor tyrosine phosphatase protein, or a fusion 

25 protein comprising all or a portion of a receptor tyrosine phosphatase protein. 

Biologically active variants 

Human receptor tyrosine phosphatase polypeptide variants that are biologically 
30 active, e.g., retain an enzymatic activity, also are receptor tyrosine phosphatase 
polypeptides. Preferably, naturally or non-naturally occurring receptor tyrosine 
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phosphatase polypeptide variants have amino acid sequences which are at least about 
87, preferably about 90, 96, 96, 98, or 99% identical to the amino acid sequence 
shown in SEQ ED NO:2 or a fragment thereof. Percent identity between a putative 
receptor tyrosine phosphatase polypeptide variant and an amino acid sequence of 
5 SEQ ID NO:2 is determined using the Blast2 alignment program (Blosum62, Expect 
10, standard genetic codes). . 

Variations in percent identity can be due, for example, to amino acid substitutions, 
insertions, or deletions. Amino acid substitutions are defined as one for one amino 
10 acid replacements. They are conservative in nature when the substituted amino acid 
has similar structural and/or chemical properties. Examples of conservative 
replacements are substitution of a leucine with an isoieucine or valine, an aspartate 
with a glutamate, or a threonine with a serine. 

15 Amino acid insertions or deletions are changes to or within an amino acid sequence. 
They typically fall in the range of about 1 to 5 amino acids. Guidance in determining 
which amino acid residues can be substituted, inserted, or deleted without abolishing 
biological or immunological activity of a human receptor tyrosine phosphatase 
polypeptide can be found using computer programs well known in the art, such as 

20 DNASTAR software. 

The invention additionally, encompasses receptor tyrosine phosphatase polypeptides 
that are differentially modified during or after translation, e.g., by glycosylation, 
acetylation, phosphorylation, amidation, derivatization by known protecting/blocking 

'25 groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, 
etc. Any of numerous chemical modifications can be carried out by known 
techniques including, but not limited, to specific chemical cleavage by cyanogen 
bromide, trypsin; chymotrypsin, papain, V8 protease, NaBHU, acetylation,' 
formylation, oxidation, reduction, metabolic synthesis in the presence of 

30 tunicamycin, etc. 
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Additional post-translational modifications encompassed by the invention include, 
for example, e.g., N-linked or O-linked carbohydrate chains, processing of N- 
tenninal or C-terminal ends), attachment of chemical moieties to the amino acid 
backbone, chemical modifications of N-linked or O-linked carbohydrate chains, and 
5 addition or deletion of an N-terminal methionine residue as a result of prokaryotic 
host cell expression. The receptor tyrosine phosphatase polypeptides may also be 
modified with a detectable label, such as an enzymatic, fluorescent, isotopic or 
affinity label to allow for detection and isolation of the protein. 

10 The invention also provides chemically modified derivatives of receptor tyrosine 
phosphatase polypeptides that may provide additional advantages such as increased 
solubility, stability and circulating time of the polypeptide, or decreased immuno- 
genicity (see U.S. Patent No. 4,179,337). The chemical moieties for derivitization 
can be selected from water soluble polymers such as polyethylene glycol, ethylene 

15 glycol/propylene glycol copolymers, carboxymethylcellulose, dextran, polyvinyl 
. alcohol, and the like. The polypeptides can be modified at random or predetermined 
positions within the molecule and can include one, two, three, or more attached 
chemical moieties. 

20 Whether an amino acid change or a polypeptide modification results in a biologically 
active receptor tyrosine phosphatase polypeptide can readily be determined by 
assaying for enzymatic activity, as described for example, in Calvert-Evers & 
Hammond, Cell Biol Int. 24, 559-68, 2000. 

25 Fusion proteins 

Fusion proteins are useful for generating antibodies against receptor tyrosine 
phosphatase polypeptide amino acid sequences and for use in various assay systems.' 
For example, fusion proteins can be used to identify proteins that interact with 
30 portions of a human receptor tyrosine phosphatase polypeptide. Protein affinity 
chromatography or library-based assays for protein-protein interactions, such as the 
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yeast two-hybrid or phage display systems, can be used for this purpose. Such 
methods are well known in the art and also can be used as drug screens. 

A human receptor tyrosine phosphatase polypeptide fusion protein comprises two 
5 polypeptide segments fused together by means of a peptide bond. The first 
polypeptide segment comprises at least 6, 10, 15, 20, 25, 50, 75, 100, 125, 150, 175, 
200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 525, 550, 575, 600, 
625, 650, 675, 700, 725, 750, 775, 800, 825, 850, 875, 900, 925, 950, 975, 1000, 
1025, 1050, 1075, 1100, 1125, 1150, 1175, 1200, 1225, 1250, 1275, 1300, 1325, 

10 1350, 1375, 1400, 1425, 1450, 1475, 1500, 1525, 1550, 1575, 1600, 1625, 1650, 
1675, 1700, 1725, 1750, 1775, 1800, 1825, 1850, 1875, 1900, 1925, 1950, 1975, 
2000, 2025, 2050, 2075, 2100, 2125, 2150, 2175, 2200, 2225, 2250, 2275 or 2299 
contiguous amino acids of SEQ ID NO:2 or of a biologically active variant, such as 
those described above. The first polypeptide segment also can comprise full-length 

15 receptor tyrosine phosphatase protein. 

The second polypeptide segment can be a full-length protein or a protein fragment 
Proteins commonly used in fusion protein construction include p-galactosidase, p- 
glucuronidase, green fluorescent protein (GFP), autofluorescent proteins, including 

20' blue fluorescent protein (BFP), glutathione-S-transferase (GST), luciferase, 
horseradish peroxidase (HRP), and chloramphenicol acetyltransferase (CAT). 
Additionally, epitope tags are used in fusion protein constructions, including 
histidine (His) tags, FLAG tags, influenza hemagglutinin (HA) tags, Myc tags, VSV- 
G tags, and thioredoxin (Trx) tags. Other fusion constructions can include maltose 

25 binding protein (MBP), S-tag, Lex a DNA binding domain (DBD) fusions, GAL4 
DNA binding domain fusions, and herpes simplex virus (HSV) BP 16 protein fusions. 
A fusion protein also can be engineered to contain a cleavage site located between 
the receptor tyrosine phosphatase polypeptide-encoding sequence and the' 
heterologous protein sequence, so that the receptor tyrosine phosphatase polypeptide 

30 can be cleaved and purified away from the heterologous moiety. 
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A fusion protein can be synthesized chemically, as is known in the art. Preferably, a 
fusion protein is produced by covalently linking two polypeptide segments or by 
standard procedures in the art of molecular biology. Recombinant DNA methods 
can be used to prepare fusion proteins, for example, by making a DNA construct 
5 which comprises coding sequences selected from SEQ ID NO:l in proper reading' 
frame with nucleotides encoding the second polypeptide segment and expressing the 
DNA construct in a host cell, as is known in the art. Many kits for constructing 
fusion proteins are available from companies such as Promega Corporation 
(Madison, WI), Stratagene (La Jolla, CA), CLONTECH (Mountain View, CA), 
10 Santa Cruz Biotechnology (Santa Cruz, CA), MBL International Corporation (MIC; 
Watertown, MA), and Quantum Biotechnologies (Montreal, Canada; 1-888-DNA- 
KITS). 

Identification of species homologs 

15 

Species homologs of human receptor tyrosine phosphatase polypeptide can be 
obtained using receptor tyrosine phosphatase polypeptide polynucleotides (described 
below) to make suitable probes or primers for screening cDNA expression libraries 
from other species, such as mice, monkeys,- or yeast, identifying cDNAs which 
20 encode homologs of receptor tyrosine phosphatase polypeptide, and expressing the 
cDNAs as is known in the art. 

Polynucleotides 

25 A human receptor tyrosine phosphatase polynucleotide can be single- or double- 
stranded and comprises a coding sequence or the complement of a coding sequence 
for a receptor tyrosine phosphatase polypeptide. A coding sequence for human 
receptor tyrosine phosphatase is shown in SEQ ID NO: 1 . 

30 Degenerate nucleotide sequences encoding human receptor tyrosine phosphatase 
polypeptides, as well as homologous nucleotide sequences which are at least about 
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50, 55, 60, 65, 70, preferably about 75, 90, 96, 98, or 99% identical to the nucleotide 
sequence shown in SEQ ID NO:l or its complement also are receptor tyrosine 
phosphatase polynucleotides. Percent sequence identity between the sequences of 
two polynucleotides is determined using computer programs such as ALIGN which 
5 employ the FASTA algorithm, using an affine gap search with a gap open penalty of 
-12 and a gap extension penalty of -2. Complementary DNA (cDNA) molecules, 
species homologs, and variants of receptor tyrosine phosphatase polynucleotides that 
encode biologically active receptor tyrosine phosphatase polypeptides also are 
receptor tyrosine phosphatase polynucleotides. Polynucleotide fragments comprising 
10 at least 8, 9, 10, 1 1, 12, 15, 20, or 25 contiguous nucleotides of SEQ ID NO:l or its 
complement also are receptor tyrosine phosphatase polynucleotides. These frag- 
ments can be used, for example, as hybridization probes or as antisense oligo- 
nucleotides. 

1 5 Identification of polynucleotide variants and homologs 

Variants and homologs of the receptor tyrosine phosphatase polynucleotides 
described above also are receptor tyrosine phosphatase polynucleotides. Typically, 
homologous receptor tyrosine phosphatase polynucleotide sequences can be identi- 

20 fied by hybridization of candidate polynucleotides to known receptor tyrosine phos- 
phatase polynucleotides under stringent conditions, as is known in the art. For 
example, using the following wash conditions~2X SSC (0.3 M NaCl, 0.03 M sodium 
citrate, pH 7.0), 0.1% SDS, room temperature twice, 30 minutes each; then 2X SSC, 
0.1% SDS, 50 °C once, 30 minutes; then 2X SSC, room temperature twice, 10 

25 minutes each homologous sequences can be identified which contain at most about 
25-30% basepair mismatches. More preferably, homologous nucleic acid strands 
contain 15-25% basepair mismatches, even more preferably 5-15% basepair 
mismatches. 

30 Species homologs of the receptor tyrosine phosphatase polynucleotides disclosed 
herein also can be identified by making suitable probes or primers and screening 
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cDNA expression libraries from other species, such as mice, monkeys, or yeast. 
Human variants of receptor tyrosine phosphatase polynucleotides can be identified, 
for example, by screening human cDNA expression libraries. It is well known that 
the T m of a double-stranded DNA decreases by 1-1.5. °C with every 1% decrease in 
5 homology (Bonner et a/., J. Mol Biol 81, 123 (1973). Variants of human receptor 
tyrosine phosphatase polynucleotides or receptor tyrosine phosphatase polynucleo- 
tides of other species can therefore be identified by hybridizing a putative 
homologous receptor tyrosine phosphatase polynucleotide with a polynucleotide 
having a nucleotide sequence of SEQ ID NO:l or the complement thereof to form a 
10 test hybrid. The melting temperature of the test hybrid is compared with the melting 
temperature of a hybrid comprising polynucleotides having perfectly complementary 
nucleotide sequences, and the number or percent of basepair mismatches within the 
test hybrid is calculated. 

15 Nucleotide sequences which hybridize to receptor tyrosine phosphatase polynucleo- 
tides or their complements following stringent hybridization and/or wash conditions 
also are receptor tyrosine phosphatase polynucleotides. Stringent wash conditions 
are well known and understood in the art and are disclosed, for example, in 
Sambrook et al 9 Molecular Cloning: A Laboratory Manual, 2d ed., 1989, at 

20 pages 9.50-9.51. . 

Typically, for stringent hybridization conditions a combination of temperature and 
salt concentration should be chosen that is approximately 12-20 °C below the 
calculated T m of the hybrid under study. The T m of a hybrid between a receptor 
25 tyrosine phosphatase polynucleotide having a nucleotide sequence shown in SEQ ED 
NO:l or the complement thereof and a polynucleotide sequence which is at least 
about 50, preferably about 75, 90, 96, or 98% identical to one of those nucleotide 
sequences can be calculated, for example, using the equation of Bolton and" 
McCarthy, Proc. Natl Acad. Set U.S.A. 48, 1390 (1962): 



30 
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T m = 81.5 °C - 16.6(logi 0 [Na + ]) + 0.41(%G + C) - 
0.63(%formamide) - 600//), where / = the length of the 
hybrid in basepairs. 

5 . Stringent wash conditions include, for example, 4X SSC at 65 °C, or 50% 
formamide, 4X SSC at 42 °C, or 0.5X SSC, 0.1% SDS at 65 °C. Highly stringent 
wash conditions include, for example, 0.2X SSC at 65 °C. 

Preparation of polynucleotides 

10 

A human receptor tyrosine phosphatase polynucleotide can be isolated free of other 
cellular components such as membrane components, proteins, and lipids. Poly- 
nucleotides can be made by a cell and isolated using standard nucleic acid 
purification techniques, or synthesized using an amplification technique, such as the 

15 polymerase chain reaction (PCR), or by using an automatic synthesizer. Methods for 
isolating polynucleotides are routine and are known in the art. Any such technique 
for obtaining a polynucleotide can be used to obtain isolated receptor tyrosine 
phosphatase polynucleotides. For example, restriction enzymes and probes can be 
used to isolate polynucleotide fragments, which comprise receptor tyrosine phos- 

20 phatase nucleotide sequences. Isolated polynucleotides are in preparations that are 
free or at least 70, 80, or 90% free of other molecules. 

Human receptor tyrosine phosphatase cDNA molecules can be made with standard 
molecular biology techniques, using receptor tyrosine phosphatase mRNA as a 
25 template. Human receptor tyrosine phosphatase cDNA molecules can thereafter be 
replicated using molecular biology techniques known in the art and disclosed in 
manuals such as Sambrook et al (1989). An amplification technique, such as PCR, 
can be used to obtain additional copies of polynucleotides of the invention, using' 
either human genomic DNA or cDNA as a template. 



30 
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Alternatively, synthetic chemistry techniques can be used to synthesize receptor 
tyrosine phosphatase polynucleotides. The degeneracy of the genetic code allows 
alternate nucleotide sequences to be synthesized which will encode a human receptor 
tyrosine phosphatase polypeptide having, for example, an amino acid sequence 
5 shown in SEQ ID NO:2 or a biologically active variant thereof. 

Extending polynucleotides 

Both methods are well known in the art. Briefly, filters with bacterial colonies 
1 0 containing the library in pBluescript or bacterial lawns containing lambda plaques are 
denatured, and the DNA is fixed to the filters. The filters are hybridized with the 
labeled probe using hybridization conditions described by Davis el al y 1986. The 
partial sequences, cloned into lambda or pBluescript, can be used as positive controls 
to assess background binding and to adjust the hybridization and washing 
15 stringencies necessary for accurate clone identification. The resulting autoradio- 
grams are compared to duplicate plates of colonies or plaques; each exposed spot 
corresponds to a positive colony or plaque. The colonies or plaques are selected, 
expanded and the DNA is isolated from the colonies for further analysis and 
sequencing. 

20 

Positive cDNA clones are analyzed to determine the amount of additional sequence 
they contain using PCR with one primer from the partial sequence and the other 
primer from the vector. Clones with a larger vector-insert PCR product than the 
original partial sequence are analyzed by restriction digestion and DNA sequencing 
25 to determine whether they contain an insert of the same size or similar as the mRNA 
size determined from Northern blot Analysis. 

Once one or more overlapping cDNA clones are identified, the complete sequence of 
the clones can be determined, for example after exonuclease III digestion 
30 (McCombie et ah, Methods 3, 33-40, 1991). A series of deletion clones are 
generated, each of which is sequenced. The resulting overlapping sequences are 
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assembled into a single contiguous sequence of high redundancy (usually three to 
five overlapping sequences at each nucleotide position), resulting in a highly accurate 
final sequence. 

5 Various PCR-based methods can be used to extend the nucleic acid sequences 
disclosed herein to detect upstream sequences such as promoters and regulatory 
elements. For example, restriction-site PCR uses universal primers to retrieve 
unknown sequence adjacent to a known locus (Sarkar, PCR Methods Applic. 2, 
318-322, 1993). Genomic DNA is first amplified in the presence of a primer to a 
10 linker sequence and a primer specific to the known region. The amplified sequences 
are then subjected to a second round of PCR with the same linker primer and another 
specific primer internal to the first one. Products of each round of PCR are 
transcribed with an appropriate RNA polymerase and sequenced using reverse 
transcriptase. 

15 

Inverse PCR also can be used to amplify or extend sequences using divergent primers 
based on a known region (Triglia et aL, Nucleic Acids Res. 16, 8186, 1988). Primers 
can be designed using commercially available software, such as OLIGO 4.06 Primer 
Analysis software (National Biosciences Inc., Plymouth, Minn.), to be 22-30 

20 nucleotides in length, to have a GC content of 50% or more, and to anneal to the 
target sequence at temperatures about 68-72 °C. The method uses several restriction 
enzymes to generate a suitable fragment in the known region of a gene. The 
fragment is then circularized by intramolecular ligation and used as a PCR template. 
Another method which can be used is capture PCR, which involves PCR 

25 amplification of DNA fragments adjacent to a known sequence in human and yeast 
artificial chromosome DNA (Lagerstrom et al 9 PCR Methods Applic. 7, 111-119, 
1991). In this method, multiple restriction enzyme digestions and ligations also can 
be used to place an engineered double-stranded sequence into an unknown fragment " 
of the DNA molecule before performing PCR. 

30 



WO 03/033688 



PCT/EP02/11473 



-21- 

Another method which can be used to retrieve unknown sequences is that of Parker 
et al, Nucleic Acids Res. 19, 3055-3060, 1991). Additionally, PCR, nested primers, 
and PROMOTERFINDER libraries (CLONTECH, Palo Alto, Calif.) can be used to 
walk genomic DNA (CLONTECH, Palo Alto, Calif.). This process avoids the need 
5 to screen libraries and is useful in finding intron/exon junctions. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. Randomly-primed libraries are preferable, in 
that they will contain more sequences which contain the 5' regions of genes. Use of 
10 a randomly primed library may be especially preferable for situations in which an 
oligo d(T) library does not yield a full-length cDNA. Genomic libraries can be 
useful for extension of sequence into 5' non-transcribed regulatory regions. 

Commercially available capillary electrophoresis systems can be used to analyze the 
15 size or confirm the nucleotide sequence of PCR or sequencing products. For ex- 
■ ample, capillary sequencing can employ flowable polymers for electrophoretic 
separation, four different fluorescent dyes (one for each nucleotide) that are laser 
activated, and detection of the emitted wavelengths by a charge coupled device 
camera. Output/light intensity can be converted to electrical signal using appropriate 
20 software (e.g. GENOTYPER and Sequence NAVIGATOR, Perkin Elmer), and the 
entire process from loading of samples to computer analysis and electronic data 
display can be computer controlled. Capillary electrophoresis is especially preferable 
for the sequencing of small pieces of DNA that might be present in limited amounts 
in a particular sample. 

25 

Obtaining Polynucleotides 

Receptor tyrosine phosphatase polypeptides can be obtained, for example, by' 
purification from human cells, by expression of receptor tyrosine phosphatase 
30 polynucleotides, or by direct chemical synthesis. 
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Protein purification 

Receptor tyrosine phosphatase polypeptides can be purified from any human cell 
which expresses the receptor, including host cells which have been transfected with 
5 receptor tyrosine phosphatase polynucleotides. A purified receptor tyrosine phos- 
phatase polypeptide is separated from other compounds that normally associate with 
the receptor tyrosine phosphatase polypeptide in the cell, such as certain proteins, 
carbohydrates, or lipids, using methods well-known in the art. Such methods 
include, but are not limited to, size exclusion chromatography, ammonium sulfate 
10 fractionation, ion exchange chromatography, affinity chromatography, and 
preparative gel electrophoresis. 

Receptor tyrosine phosphatase polypeptide can be conveniently isolated as a complex 
with its associated G protein, as described in the specific examples, below. A 
15 preparation of purified receptor tyrosine phosphatase polypeptides is at least .80% 
pure; preferably, the preparations are 90%, 95%, or 99% pure: Purity of the 
preparations can be assessed by any means known in the art, such as SDS- 
polyacrylamide gel electrophoresis. 
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Expression of polynucleotides 

To express a human receptor tyrosine phosphatase polynucleotide, the polynucleotide 
can be inserted into an expression vector which contains the necessary elements for 
5 the transcription and translation of the inserted coding sequence. Methods which are 
well known to those skilled in the art can be used to construct expression vectors 
containing sequences encoding receptor tyrosine phosphatase polypeptides and 
appropriate transcriptional and translational control elements. These methods 
include in vitro recombinant DNA techniques, synthetic techniques, and in vivo 
10 genetic recombination. Such techniques are described, for example, in Sambrook et 
al (1989) and in Ausubel et ah, Current Protocols in Molecular Biology, 
John Wiley & Sons, New York, N.Y., 1989. 

A variety of expression vector/host systems can be utilized to contain and express 
15 sequences encoding a human receptor tyrosine phosphatase polypeptide. These 
include, but are not limited to, microorganisms, such as bacteria transformed with 
recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast 
transformed with yeast expression vectors, insect cell systems infected with virus 
expression vectors (e.g., baculovirus), plant cell systems transformed with virus 
20 expression vectors (e.g., cauliflower mosaic virus, CaMV; tobacco mosaic virus, 
TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids), or animal 
cell systems. 

The control elements or regulatory sequences are those non-translated regions of the 
25 vector — enhancers, promoters, 5' and 3' untranslated regions - which interact with 
host cellular proteins to carry out transcription and translation. Such elements can 
vary in their strength and specificity. Depending on the vector system and host 
utilized, any number of suitable transcription and translation elements, including' 
constitutive and inducible promoters, can be used. For example, when cloning in 
30 bacterial systems, inducible promoters such as the hybrid lacZ promoter of the 
BLUESCRIPT phagemid (Stratagene, LaJolla, Calif.) or pSPORTl plasmid (Life 
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Technologies) and the like can be used. The baculovirus polyhedrin promoter can 
be used in insect cells. Promoters or enhancers derived from the genomes of plant 
cells (e.g., heat shock, RUBISCO, and storage protein genes) or from plant viruses 
. (e.g., viral promoters or leader sequences) can. be cloned into the vector. In 
5 mammalian cell systems, promoters from mammalian genes or from mammalian 
viruses are preferable. If it is necessary to generate a cell line that contains multiple 
copies of a nucleotide sequence encoding a human receptor tyrosine phosphatase 
polypeptide, vectors based on SV40 or EBV can be used with an appropriate 
selectable marker. 

10 

Bacterial and yeast expression systems 

In bacterial systems, a number of expression vectors can be selected depending upon 
the use intended for the receptor tyrosine phosphatase polypeptide. For example, 

15 when a large quantity of a human receptor tyrosine phosphatase polypeptide is 
needed for the induction of antibodies, vectors which direct high level expression of 
fusion proteins that are readily purified can be used. Such vectors include, but are 
not limited to, multifunctional E. coli cloning and expression vectors such as 
. BLUESCREPT (Stratagene). In a BLUESCRIPT vector, a sequence encoding the 

20 receptor tyrosine phosphatase polypeptide can be ligated into the vector in frame 
with sequences for the amino-terminal Met and the subsequent 7 residues of 
p-gaiactosidase so that a hybrid protein is produced. pIN vectors (Van Heeke & 
Schuster, J.. Biol Chem. 264 \ 5503-55.09, 1989) or pGEX vectors (Promega, 
Madison, Wis.) also can be used to express foreign polypeptides as fusion proteins 

25 with glutathione S-transferase (GST). In general, such fusion proteins are soluble 
and can easily be purified from lysed cells by adsorption to glutathione-agarose 
beads followed by elution in the presence of free glutathione. Proteins made in such 
systems can be designed to include heparin, thrombin, or factor Xa protease cleavage' 
sites so that the cloned polypeptide of interest can be released from the GST moiety 

30 at will. 



WO 03/033688 



PCT7EP02/11473 



-25- 

In the yeast Saccharomyces cerevisiae, a number of vectors containing constitutive 
or inducible promoters such as alpha factor, alcohol oxidase, and PGH can be used. 
For reviews, see Ausubel et al. (1989) and Grant et aL, Methods Enzymol 153, 
516-544, 1987. 

5 

Plant and insect expression systems 

If plant expression vectors are used, the expression of sequences encoding receptor 
tyrosine phosphatase polypeptides can be driven by any of a number of promoters. 

10 For example, viral promoters such as the 35S and 19S promoters of CaMV can be 
used alone or in combination with the omega leader sequence from TMV 
(Takamatsu, EMBOJ. 6, 307-311, 1987). Alternatively, plant promoters such as the 
small subunit of RUBISCO or heat shock promoters can be used (Coruzzi et al, 
EMBO J. 3, 1671-1680, 1984; Broglie et aL, Science 224, 838-843, 1984; Winter et 

15 al, Results Probl Cell Differ. 1 7, 85-105, 1991). These constructs can be introduced 
into plant cells by direct DNA transformation or by pathogen-mediated transfection. 
Such techniques are described in a number of generally available reviews (e.g., 
Hobbs or Murray, in McGraw Hill Yearbook of Science and Technology, 
McGraw Hill, New York, N.Y., pp. 191-196, 1992). 

20 

An insect system also can be used to express a human receptor tyrosine phosphatase 
polypeptide. For example, in one such system Autographa californica nuclear 
polyhedrosis virus (AcNPV) is used as a vector to express foreign genes in 
Spodopterafrugiperda cells or in Trichoplusia larvae. Sequences encoding receptor 

25 tyrosine phosphatase polypeptides can be- cloned into a non-essential region of the 
virus, such as the polyhedrin gene, and placed under control of the polyhedrin 
promoter. Successful insertion of receptor tyrosine phosphatase polypeptides will 
render the polyhedrin gene inactive and produce recombinant virus lacking coat' 
protein. The recombinant viruses can then be used to infect S. frugiperda cells or 

30 Trichoplusia larvae in which receptor tyrosine phosphatase polypeptides can be 
expressed (Engelhard etal, Proc. Nat Acad Sci. 91, 3224-3227, 1994). 
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Mammalian expression systems 

A number of viral-based expression systems can be used to express receptor tyrosine 
5 . phosphatase polypeptides in mammalian host cells. For example, if an adenovirus is 
used as an expression vector, sequences encoding receptor tyrosine phosphatase 
polypeptides can be ligated into an adenovirus transcription/translation complex 
comprising the late promoter and tripartite leader sequence. Insertion in a 
non-essential El or E3 region of the viral genome can be used to obtain a viable virus 
10 which is capable of expressing a human receptor tyrosine phosphatase polypeptide 
in infected host cells (Logan & Shenk, Proc. Natl Acad. Set 81, 3655-3659, 1984). 
If desired, transcription enhancers, such as the Rous sarcoma virus (RSV) enhancer, 
can be used to increase expression in mammalian host cells. 

15 Human artificial chromosomes (HACs) also can be used to deliver larger fragments 
of DNA than caii be contained and expressed in a plasmid. HACs of 6M to 10M are 
constructed and delivered to cells via conventional delivery methods (e.g., liposomes, 
polycationic amino polymers, or vesicles). 

20 Specific initiation signals also can be used to achieve more efficient translation of 
sequences encoding receptor tyrosine phosphatase polypeptides. Such signals in- 
clude the ATG initiation codon and adjacent sequences. In cases where sequences 
encoding a human receptor tyrosine phosphatase polypeptide, its initiation codon, 
and upstream sequences are inserted into the appropriate expression vector, no 

25 additional transcriptional or translational control signals may be needed. However, 
in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
translational control signals (including the ATG initiation codon) should be provided. 
The initiation codon should be in the correct reading frame to ensure translation of 
the entire insert. Exogenous translational elements and initiation codons can be of 

30 various origins, both natural and synthetic. The efficiency of expression can be 
enhanced by the inclusion of enhancers which are appropriate for the particular cell 
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system which is used (see Scharf et al> Results Probl Cell Differ. 20, 125-162, 
1994). 

Host cells 

5 

A host cell strain can be chosen for its ability to modulate the expression of the 
inserted sequences or to process the expressed receptor tyrosine phosphatase poly- 
peptide in the desired fashion. Such modifications of the polypeptide include, but 
are not limited to, acetylation, carboxylation, glycosylation, phosphorylation, lipi- 

10 dation, and acylation. Post-translational processing which cleaves a "prepro" form of 
the polypeptide also can be used to facilitate correct insertion, folding and/or 
function. Different host cells that have specific cellular machinery and characteristic 
mechanisms for post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and 
WI38), are available from the American Type Culture Collection (ATCC; 10801 . 

15 University Boulevard, Manassas, VA 20110-2209) and can be chosen to ensure the 
.correct modification and processing of the foreign protein. 

Stable expression is preferred for long-term, high-yield production of recombinant 
• proteins. For example, cell lines which stably express receptor tyrosine phosphatase 

20 polypeptides can be transformed using expression vectors which can contain viral 
origins of replication and/or endogenous expression elements and a selectable marker 
gene on the same or on a separate vector. Following the introduction of the vector, 
cells can be allowed to grow for 1-2 days in an enriched medium before they are 
switched to a selective medium. The purpose of the selectable marker is to confer 

25 resistance to selection, and its presence allows growth and recovery of cells which 
successfully express the introduced receptor tyrosine phosphatase sequences. 
Resistant clones of stably transformed cells can be proliferated using tissue culture 
techniques appropriate to the cell type. See, for example, ANIMAL CELL CULTURE," 
R.L Freshney, ed., 1986. 

30 
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Any number of selection systems can be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase 
(Wigler et al, Cell 11, 223-32, 1977) and adenine phosphoribosyltransferase (Lowy 
et al, Cell 22, 817-23, 1980) genes that can be employed in tic or aprf cells, 
5 respectively. Also, antimetabolite, antibiotic, or herbicide resistance can be used as 
the basis for selection. For example, dhfr confers resistance to methotrexate (Wigler 
et al., Proc. Natl. Acad. Sci. 77, 3567-70, 1980), npt confers resistance to the 
. aminoglycosides, neomycin and G-418 (Colbere-Garapin et al, J. Mol Biol. 150, 
1-14, 1981), and ah and pat confer resistance to chlorsulfuron and phosphinotricin 

10 acetyltransferase, respectively (Murray, 1992, supra). Additional selectable genes 
have been described. For example, trpB allows cells to utilize indole in place of 
tryptophan, or hisD, which allows cells to utilize histinol in place of histidine 
(Hartman & Mulligan, Proc. Natl. Acad. Sci. 85, 8047-51, 1988). Visible markers 
such as anthocyanins, P-glucuronidase and its substrate GUS, and luciferase and its 

15 substrate luciferin, can be used to identify transfomiants and to quantify the amount 
of transient or stable protein expression attributable to a specific vector system 
(Rhodes et ah, Methods Mol. Biol. 55, 121-131, 1995). 

Detecting expression 

20 

Although the presence of marker gene expression suggests that the receptor tyrosine 
phosphatase polynucleotide is also present, its presence and expression may need to 
be confirmed. For example, if a sequence encoding a human receptor tyrosine 
phosphatase polypeptide is inserted within a marker gene sequence, transformed cells 

25 containing sequences which encode an receptor tyrosine phosphatase polypeptide 
can be identified by the absence of marker gene function. Alternatively, a marker 
gene can be placed in tandem with a sequence encoding a receptor tyrosine 
phosphatase polypeptide under the control of a single promoter. Expression of the' 
marker gene in response to induction or selection usually indicates expression of the 

3 0 receptor tyrosine phosphatase polynucleotide. 
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Alternatively, host cells which contain a human receptor tyrosine phosphatase 
polynucleotide and which express a human receptor tyrosine phosphatase . poly- 
peptide can be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA 

5 hybridizations and protein bioassay or immunoassay techniques that include 
membrane, solution, or chip-based technologies for the detection and/or 
quantification of nucleic acid or protein. For example, the presence of a polynucleo- 
tide sequence encoding an receptor tyrosine phosphatase polypeptide can be detected 
by DNA-DNA or DNA-RNA hybridization or amplification using probes or 

10 fragments or fragments of polynucleotides encoding a human receptor tyrosine phos- 
phatase polypeptide. Nucleic acid amplification-based assays involve the use of 
oligonucleotides selected from sequences encoding a receptor tyrosine phosphatase 
polypeptide to detect transformants which contain an receptor tyrosine phosphatase 
polynucleotide. 

15 

A variety of protocols for detecting and measuring the expression of a human 
receptor tyrosine phosphatase polypeptide, using either polyclonal or monoclonal 
antibodies specific for the polypeptide, are known in the art. Examples include 
enzyme-linked immunosorbent assay (ELISA), radioimmunoassay (RIA), and 

20 fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immuno- 
assay using monoclonal antibodies reactive to two non-interfering epitopes on a 
human receptor tyrosine phosphatase polypeptide can be used, or a competitive 
binding assay can be employed. These and other assays are described in Hampton et 
al 9 Serological Methods: A Laboratory Manual, APS Press, St. Paul, Minn., 

25 1990) and Maddox et al. 9 J. Exp. Med 158, 121 1-1216, 1983). 

A wide variety of labels and conjugation techniques are known by those skilled in the 
art and can be used in various nucleic acid and amino acid assays. Means for' 
producing labeled hybridization or PCR probes for detecting sequences related to 
30 polynucleotides encoding receptor tyrosine phosphatase polypeptides include oligo- 
labeling, nick translation, end-labeling, or PCR amplification using a labeled 
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nucleotide. Alternatively, sequences encoding a human receptor tyrosine phos- 
phatase polypeptide can be cloned into a vector for the production of an mRNA 
probe. Such vectors are known in the art, are commercially available, and can be 
used to synthesize RNA probes in vitro by addition of labeled nucleotides and an 
5 appropriate RNA polymerase such as T7, T3, or SP6. These procedures can be 
conducted using a variety of commercially available kits (Amersham Pharmacia 
Biotech, Promega, and US Biochemical). Suitable reporter molecules or labels, 
which can be used for ease of detection include radionuclides, enzymes, and 
fluorescent, chemiluminescent, or chromogenic agents, as well as . substrates, 
1 0 cofactors, inhibitors, magnetic particles, and the like. 

Expression and purification of polypeptides 

Host cells transformed with nucleotide sequences encoding a human receptor 
15 tyrosine phosphatase polypeptide can be cultured under conditions suitable for the 
expression and recovery of the protein from cell culture. The polypeptide produced 
by a transformed cell can be secreted or contained intracellularly depending on the 
sequence and/or the vector used. As will be understood by those of skill in the art, 
expression vectors containing polynucleotides which encode receptor tyrosine phos- 
20 phatase polypeptides can be designed to contain signal sequences which direct 
secretion of soluble receptor tyrosine phosphatase polypeptides through a prokaryotic 
or eukaryotic cell membrane or which direct the membrane insertion of membrane- 
bound receptor tyrosine phosphatase polypeptide. 

25 As discussed above, other constructions can be used to join a sequence encoding a 
human receptor tyrosine phosphatase polypeptide to a nucleotide sequence encoding 
a polypeptide domain which will facilitate purification of soluble proteins. Such 
purification facilitating domains include, but are not limited to, metal chelating' 
peptides such as histidine-tryptophan modules that allow purification on immobilized 

30 metals, protein A domains that allow purification on immobilized immunoglobulin, 
and the domain utilized in the FLAGS extension/affinity purification system 
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(Immunex Corp., Seattle, Wash.). Inclusion of cleavable linker sequences such as 
those specific for Factor Xa or enterokinase (Invitrogen, San Diego, CA) between the 
purification domain and the receptor tyrosine phosphatase polypeptide also can be 
used to facilitate purification. One such expression vector provides for expression of 
5 a fusion protein containing a human receptor tyrosine phosphatase polypeptide and 6 
histidine residues preceding a thioredoxin or an enterokinase cleavage site. The 
histidine residues facilitate purification by IMAC (immobilized metal ion affinity 
chromatography, as described in Porath et ah, Prot. Exp. Purif 3, 263-281, 1992), 
while the enterokinase cleavage site provides a means for purifying the receptor 
10 tyrosine phosphatase polypeptide from the fusion protein. Vectors that contain 
fusion proteins are disclosed in Kroll et of., DNA Cell Biol 12, 441-453, 1993. 

Chemical synthesis 

15 Sequences encoding a human receptor tyrosine phosphatase polypeptide can be 
synthesized, in whole or in part, using chemical methods well known in the art (see 
Caruthers et al, Nucl Acids Res. Symp. Ser. 215-223, 1980; Horn et al Nucl. Acids 
Res. Symp. Ser. '225-232, 1980). Alternatively, a human receptor tyrosine phos- 
phatase polypeptide itself can be produced using chemical methods to synthesize its 

20 amino acid sequence, such as by direct peptide synthesis using solid-phase 
techniques (Merrifield, J. Am. Chem. Soc. 85, 2149-2154, 1963; Roberge et al, 
Science 269, 202-204, 1995). Protein synthesis can be performed using manual 
techniques or by automation. Automated synthesis can be achieved, for example, 
using Applied Biosystems 431 A Peptide Synthesizer (Perkin Elmer). Optionally, 

25 fragments of receptor tyrosine phosphatase polypeptides can be separately 
synthesized and combined using chemical methods to produce a full-length 
molecule. 

The newly synthesized peptide can be substantially purified by preparative high 
30 performance liquid chromatography (e.g., Creighton, PROTEINS: STRUCTURES AND 
Molecular Principles, WH Freeman and Co., New York, N.Y., 1983). The 
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composition of a synthetic receptor tyrosine phosphatase polypeptide can be 
confirmed by amino acid analysis or sequencing (e.g., the Edman degradation 
procedure; see Creighton, supra). Additionally, any portion of the amino acid 
sequence, of the receptor tyrosine phosphatase polypeptide can be altered during 
5 direct synthesis and/or combined using chemical methods with sequences from other 
proteins to produce a variant polypeptide or a fusion protein. 

As will be understood by those of skill in the art, it may be advantageous to produce 
receptor tyrosine phosphatase polypeptide-encoding nucleotide sequences possessing 
10 non-naturally occurring codons. For example, codons preferred by a particular 
prokaryotic or eukaryotic host can be selected to increase the rate of protein 
expression or to produce an RNA transcript having desirable properties, such as a 
half-life which is longer than that of a transcript generated from the naturally 
occurring sequence. 

15 

The nucleotide sequences disclosed herein can be engineered using methods 
generally known in the art to alter receptor tyrosine phosphatase polypeptide- 
encoding sequences for a variety of reasons, including but not limited to, alterations 
which modify the cloning, processing, and/or expression of the polypeptide or 
20 mRNA product. DNA shuffling by random fragmentation and PCR reassembly of 
gene fragments and synthetic oligonucleotides can be used to engineer the nucleotide 
sequences. For example, site-directed mutagenesis can be used to insert new 
restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, introduce mutations, and so forth. 

25 

Antibodies 

Any type of antibody known in the art can be generated to bind specifically to an 
epitope of a human receptor tyrosine phosphatase polypeptide. "Antibody" as used 
30 herein includes intact immunoglobulin molecules, as well as fragments thereof, such 
as Fab, F(ab% and Fv, which are capable of binding an epitope of a human receptor 
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tyrosine phosphatase polypeptide. Typically, at least 6, 8, 10, or 12 contiguous 
amino acids are required to form an epitope. However, epitopes which involve non- 
contiguous amino acids may require more, e.g., at least 15, 25, or 50 amino acids. 

5 An antibody which specifically binds to an epitope of a human receptor tyrosine 
phosphatase polypeptide can be used therapeutically, as well as in immunochemical 
assays, such as Western blots, ELISAs, radioimmunoassays, immunohistochemical 
assays, immunoprecipitations, or other immunochemical assays known in the art. 
Various immunoassays can be used to identify antibodies having the desired 
10 specificity. Numerous protocols for competitive binding or immunoradiometric 
assays are well known in the art. Such immunoassays typically involve the 
measurement of complex formation between an immunogen and an antibody that 
specifically binds to the immunogen. 

15 Typically, an antibody which specifically binds to a human receptor tyrosine phos- 
. phatase polypeptide provides a detection signal at least 5-, 10-, or 20-fold higher than 
a detection signal provided with other proteins when used in an immunochemical 
assay. Preferably, antibodies which specifically bind to receptor tyrosine phos- 
phatase polypeptides do not detect other proteins in immunochemical assays and can 

20 immunoprecipitate a human receptor tyrosine phosphatase polypeptide from solution. 

Human receptor tyrosine phosphatase polypeptides can be used to immunize a 
mammal, such as a mouse, rat, rabbit, guinea pig, monkey, or human, to produce 
polyclonal antibodies. If desired, a human receptor tyrosine phosphatase polypeptide 

25 can be conjugated to a carrier protein, such as bovine serum albumin, thyroglobulin, 
and keyhole limpet hemocyanin. Depending on the host species, various adjuvants 
can be used to increase the immunological response. Such adjuvants include, but are 
not limited to, Freund's adjuvant, mineral gels (e.g., aluminum hydroxide), and' 
surface active substances (e.g. lysolecithin, pluronic polyols, polyanions, peptides, 

30 oil emulsions, keyhole limpet hempcyanin, and dinitrophenol). Among adjuvants 
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used in humans, BCG (bacilli Calmette-Guerin) and Corynebacterium parvum are 
especially useful. 

Monoclonal antibodies which specifically bind to a human receptor tyrosine 
5 phosphatase polypeptide can be prepared using any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. These 
techniques include, but are not limited to, the hybridoma technique, the human B-cell 
hybridoma technique, and the EBV-hybridoma technique (Kohler et ah, Nature 256, 
495-497, 1985; Kozbor et ah, 1 Immunol Methods 81, 31-42, 1985; Cote et ah, 
10 Proc. Natl. Acad. Set 80, 2026-2030, 1983; Cole et ah, Mol. Cell Biol 62, 109-120, 
1984). 

In addition, techniques developed for the production of "chimeric antibodies,'* the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with 

15 appropriate antigen specificity and biological activity, can be used (Morrison et ah, 
Proc. Natl Acad Sci. 81, 6851-6855, 1984; Neuberger et ah, Nature 312, 604-608, 
1984; Takeda et ah, Nature 314, 452-454, 1985). Monoclonal and other antibodies 
also can be "humanized" to prevent a patient from mounting an immune response 
against the antibody when it is used therapeutically. Such antibodies may be 

20 sufficiently similar in sequence to human antibodies to be used directly in therapy or 
may require alteration of a few key residues. Sequence differences between rodent 
■ antibodies and human sequences can be minimized by replacing residues which 
differ from those in the human sequences by site directed mutagenesis of individual 
residues or by grating of entire complementarity determining regions. Alternatively, 
' 25 humanized antibodies can be produced using recombinant methods, as described in 
GB2188638B. Antibodies that specifically bind to a human receptor tyrosine 
phosphatase polypeptide can contain antigen binding sites which are either partially 
or fully humanized, as disclosed in U.S. 5,565,332. 

30 Alternatively, techniques described for the production of single chain antibodies can . 
be adapted using methods known in the art to produce single chain antibodies that 
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specifically bind to receptor tyrosine phosphatase polypeptides. Antibodies with 
related specificity, but of distinct idiotypic composition, can be generated by chain 
shuffling from random combinatorial immunoglobin libraries (Burton, Proc. Natl 
Acad Sci. 88, 1 1 120-23, 1991). 

5 

Single-chain antibodies also can be constructed using a DNA amplification method, 
such as PCR, using hybridoma cDNA as a template (Thirion et al, 1996, Eur. J. 
Cancer Prev. 5, 507-11). Single-chain antibodies can be mono- or bispecific, and 
can be bivalent or tetravalent. Construction of tetravalent, bispecific single-chain 
10 antibodies is taught, for example, in Coloma & Moirison, 1997, Nat. Biotechnol 15, 
159-63. Construction of bivalent, bispecific single-chain antibodies is taught in 
Mallender & Voss, 1994, J. Biol Chem. 269, 199-206. 

A nucleotide sequence encoding a single-chain antibody can be constructed using 
15 manual or automated nucleotide synthesis, cloned into an expression construct using . 
standard recombinant DNA methods, and introduced into a cell to express the coding 
sequence, as described below. Alternatively, single-chain antibodies can be 
produced directly using, for example, filamentous phage technology (Verhaar et al, 
1995, Int. J. Cancer 61, 497-501; Nicholls et al, 1993, J. Immunol Meth. 165, 81- 
20 91). 

Antibodies which specifically bind to receptor tyrosine phosphatase polypeptides 
also can be produced by inducing in vivo production in the lymphocyte population or 
by screening immunoglobulin libraries or panels of highly specific binding reagents 
25 as disclosed in the literature (Orlandi et al, Proc. Natl Acad. Sci. 86, 3833-3837, 
1989; Winter et al, Nature 349, 293-299, 1991). 

Other types of antibodies can be constructed and used therapeutically in methods of 
the invention. For example, chimeric antibodies can be constructed as disclosed in 
30 WO 93/03151. Binding proteins which are derived from immunoglobulins and 
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which are multivalent and multispecific, such as the "diabodies" described in WO 
94/13804, also can be prepared. 

Antibodies according to the invention can be purified by methods well known in the 
5 art. For example, antibodies can be affinity purified by passage over a column to 
which a human receptor tyrosine phosphatase polypeptide is bound. The bound 
antibodies can then be eluted from the column using a buffer with a high salt 
concentration. 

10 Antisense oligonucleotides 

Antisense oligonucleotides are nucleotide sequences which are complementary to a 
specific DNA or RNA sequence. Once introduced into a cell, the complementary 
nucleotides combine with natural sequences produced by the cell to form complexes 

1 5 and block either transcription or translation. Preferably, an antisense oligonucleotide 
is at least 11 nucleotides in length, but can be at least 12, 15, 20, 25, 30, 35, 40, 45, 
or 50 or more nucleotides long. Longer sequences also can be used. Antisense 
oligonucleotide molecules can be provided in a DNA construct and introduced into a 
cell as described above to decrease the level of receptor tyrosine phosphatase gene 

20 products in the cell. 

Antisense oligonucleotides can be deoxyribonucleotides, ribonucleotides, or a 
combination of both. Oligonucleotides can be synthesized manually or by an auto- 
mated synthesizer, by covalently linking the 5' end of one nucleotide with the 3' end 

25 of another nucleotide with non-phosphodiester internucleotide linkages such 
alkylphosphonates, phosphorothioates, phosphorodithioates, alkylphosphonothioates, 
alkylphosphonates, phosphoramidates, phosphate esters, carbamates, acetamidate, 
carboxymethyl esters, carbonates, and phosphate triesters. See Brown, Meth Mol ' 
Biol 20, 1-8, 1994; Sonveaux, Meth Mol Biol 26, 1-72, 1994; Uhlmann et al, 

30 Chem. Rev. 90, 543-583, 1990. 
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Modifications of receptor tyrosine phosphatase gene expression can be obtained by 
designing antisense oligonucleotides that will form duplexes to the control, 5', or 
regulatory regions of the receptor tyrosine phosphatase gene. Oligonucleotides 
derived from the transcription initiation site, e.g., between positions -10. and +10 
5 from the start site, are preferred. Similarly, inhibition can be achieved using "triple 
helix" base-pairing methodology. Triple helix pairing is useful because it causes 
inhibition of the ability of the double helix to open sufficiently for the binding of 
polymerases, transcription factors, or chaperons. Therapeutic advances using triplex 
DNA have been described in the literature (e.g., Gee et aL 9 in Huber & Carr, 
10 Molecular and Immunologic Approaches, Futura Publishing Co., Mt. Kisco, 
N.Y., 1994). An antisense oligonucleotide also can be designed to block translation 
of mRNA by preventing the transcript from binding to ribosomes. 

Precise complementarity is not required for successful complex formation between 
15 an antisense oligonucleotide and the complementary sequence of a human receptor 
tyrosine phosphatase polynucleotide. Antisense oligonucleotides which comprise, 
for example, 2, 3, 4, or 5 or more stretches of contiguous nucleotides which are pre- 
cisely complementary to an receptor tyrosine phosphatase polynucleotide, each 
separated by a stretch of contiguous nucleotides which are not complementary to 
20 adjacent receptor tyrosine phosphatase nucleotides, can provide sufficient targeting 
specificity for receptor tyrosine phosphatase mRNA. Preferably, each stretch of 
complementary contiguous nucleotides is at least 4, 5, 6, 7, or 8 or more nucleotides 
in length. Non-complementary intervening sequences are preferably 1, 2, 3, or 4 
nucleotides in length. One skilled in the art can easily use the calculated melting 
25 point of an antisense-sense pair to determine the degree of mismatching which will 
be tolerated between a particular antisense oligonucleotide and a particular receptor 
tyrosine phosphatase polynucleotide sequence. 

Antisense oligonucleotides can be modified without affecting their ability to 
30 hybridize to a human receptor tyrosine phosphatase polynucleotide. These 
modifications can be internal or at one or both ends of the antisense molecule. For 
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example, interaucleoside phosphate linkages can be modified by adding cholesteryl 
or diamine moieties with varying numbers of carbon residues between the amino 
groups and terminal- ribose. Modified bases and/or sugars, such as arabinose instead 
of ribose, or a 3\ 5'-substituted oligonucleotide in which the 3' hydroxyl group or 
the 5' phosphate group are substituted, also can be employed in a modified antisense 
oligonucleotide. These modified oligonucleotides can be prepared by methods well 
known in the art. See, e.g., Agrawal et aL, Trends Biotechnol 10, 152-158, 1992; 
Uhlmann et aL, Chem. Rev, 90, 543-584, 1990; Uhlmann et aL, Tetrahedron. Lett. 
215, 3539-3542, 1987. 

Ribozymes 

Ribozymes are RNA molecules with catalytic activity. See, e.g., Cech, Science 236, 
1532-1539; 1987; Cech, Ann. Rev. Biochem. 59, 543-568; 1990, Cech, Curr. Opin. 
Struct. Biol 2, 605-609; 1992, Couture & Stinchcomb, Trends Genet. 12, 510-515, 
1996. Ribozymes can be used to inhibit gene function by cleaving an RNA sequence, 
as is known in the art {e.g., Haseloff et aL, U.S. Patent 5,641,673). The mechanism 
of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. 
Examples include engineered hammerhead motif ribozyme molecules that can 
specifically and efficiently catalyze endonucleolytic cleavage of specific nucleotide 
sequences. 

The coding sequence of a human receptor tyrosine phosphatase polynucleotide can 
be used to generate ribozymes that will specifically bind to mRNA transcribed from 
the receptor tyrosine phosphatase polynucleotide. Methods of designing and 
constructing ribozymes which can cleave other RNA molecules in trans in a highly 
sequence specific manner have been developed and described in the art (see Haseloff' 
et aL Nature 334, 585-591, 1988). For example, the cleavage activity of ribozymes 
can be targeted to specific RNAs by engineering a discrete "hybridization" region 
into the ribozyme. The hybridization region contains a sequence complementary to 
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the target RNA and thus specifically hybridizes with the target (see, for example, 
Gerlach et ah, EP 321,201). 

Specific ribozyme cleavage sites within a human receptor tyrosine phosphatase RNA 
5 target can be identified by scanning the target molecule for ribozyme cleavage sites 
which include the following sequences: GUA, GUU, and GUC. Once identified, 
short RNA sequences of between 15 and 20 ribonucleotides corresponding to the 
region of the target RNA containing the cleavage site can be evaluated for secondary 
structural features which may render the target inoperable. Suitability of candidate 

10 receptor tyrosine phosphatase RNA targets also can be evaluated by testing 
accessibility to hybridization with complementary oligonucleotides using ribo- 
nuclease protection assays. Longer complementary sequences can be used to 
increase the affinity of the hybridization sequence for the target. The hybridizing and 
cleavage regions of the ribozyme can be integrally related such that upon hybridizing 

15 to the target RNA through the complementary regions, the catalytic region of the 
ribozyme can cleave the target. 

Ribozymes can be introduced into cells as part of a DNA construct. Mechanical 
methods, such as microinjection, liposome-mediated transfection, electroporation, or 

20 • calcium phosphate precipitation, can be used to introduce a ribozyme-containing 
DNA construct into cells in which it is desired to decrease receptor tyrosine 
phosphatase expression. Alternatively, if it is desired that the cells stably retain the 
DNA construct, the construct can be supplied on a plasmid and maintained as a 
separate element or integrated into the genome of the cells, as is known in the art. A 

25 • ribozyme-encoding DNA construct can include transcriptional regulatory elements, 
such as a promoter element, an enhancer or UAS element, and a transcriptional 
terminator signal, for controlling transcription of ribozymes in the cells. 

As taught in Haseloff et ah, U.S. Patent 5,641,673, ribozymes can be engineered so 
30 that ribozyme expression will occur in response to factors that induce expression of a 
target gene. Ribozymes also can be engineered to provide an additional level of 
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regulation, so that destruction of mRNA occurs only when both a ribozyme and a 
target gene are induced in the cells. 

Differentially expressed genes 

5 

Described herein are methods for the identification of genes whose products interact 
with human receptor tyrosine phosphatase. Such genes may represent genes that are 
differentially expressed in disorders including, but not limited to, diabetes, CNS 
disorders, COPD, and cardiovascular disorders. Further, such genes may represent 

10 genes that are differentially regulated in response to manipulations relevant to the 
progression or treatment of such diseases. Additionally, such genes may have a 
temporally modulated expression, increased or decreased at different stages of tissue 
or organism development. A differentially expressed gene may also have its ex- 
pression modulated under control versus experimental conditions. In addition, the 

15 human receptor tyrosine phosphatase gene or gene product may itself be tested for 
differential expression. 

The degree to which expression differs in a normal versus a, diseased state need only 
be large enough to be visualized via standard characterization techniques such as 
20 differential display techniques. Other such standard characterization techniques by 
which expression differences may be visualized include but are not limited to, 
quantitative RT (reverse transcriptase), PCR, and Northern analysis. 

To identify differentially expressed genes total RNA or, preferably, mRNA is 
25 isolated from tissues of interest. For example, RNA samples are obtained from 
tissues of experimental subjects and from corresponding tissues of control subjects. 
Any RNA isolation technique that does not select against the isolation of mRNA may 
be utilized for the purification of such RNA samples. See, for example, Ausubel et 
a/., ed., Current Protocols in Molecular Biology, John Wiley & Sons, Inc. 
30 New York, 1987-1993. Large numbers of tissue samples may readily be processed 
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using techniques well known to those of skill in the art, such as, for example, the 
single-step RNA isolation process of Chomczynski, U.S. Patent 4,843,155. 

Transcripts within the collected RNA samples that represent RNA produced by 
5 differentially expressed genes are identified by methods well known to those of skill 
in the art. They include, for example, differential screening (Tedder et al, Proc. 
Natl Acad. Set U.S.A. 55, 208-12, 1988), subtractive hybridization (Hedrick et a/., 
Nature 308, 149-53; Lee et al, Proc. Natl Acad. Sci. U.S.A. 88, 2825, 1984), and, 
preferably, differential display (Liang & Pardee, Science 257, 967-71, 1992; U.S. 
10 Patent 5,262,311). 

The differential expression information may itself suggest relevant methods for the 
treatment of disorders involving the human receptor tyrosine phosphatase. For 
example, treatment may include a modulation of expression of the differentially ex- 
15 pressed genes and/or the gene encoding the human receptor tyrosine phosphatase. 
The differential expression information may indicate whether the expression or 
activity of the differentially expressed gene or gene product or the human receptor 
tyrosine phosphatase gene or gene product are up-regulated or down-regulated. 

20 Screening methods 

The invention provides assays for screening test compounds that bind to or modulate 
the activity of a human receptor tyrosine phosphatase polypeptide or a human 
receptor tyrosine phosphatase polynucleotide. A test compound preferably binds to a 
25 " human receptor tyrosine phosphatase polypeptide or polynucleotide. More prefer- 
ably, a test compound decreases or increases enzymatic activity by at least about 10, 
preferably about 50, more preferably about 75, 90, or 100% relative to the absence of 
the test compound. 
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Test compounds 

Test compounds can be pharmacologic agents already known in the art or can be 
compounds previously unknown to have any pharmacological activity. The com- 
5 pounds can be naturally occurring or designed in the laboratory. They can be 
isolated from microorganisms, animals, or plants, and can be produced re- 
combinantly, or synthesized by chemical methods known in the art. If desired, test 
compounds can be obtained using any of the numerous combinatorial library 
methods known in the art, including but not limited to, biological libraries, spatially 

10 addressable parallel solid phase or solution phase libraries, synthetic library methods 
requiring deconvolution, the "one-bead one-compound" library method, and 
synthetic library methods using affinity chromatography selection. The biological 
library approach is limited to polypeptide libraries, while the other four approaches 
are applicable to polypeptide, non-peptide oligomer, or small molecule libraries of 

15 compounds. See Lam, Anticancer Drug Des. 12, 145, 1997. 

Methods for the synthesis of molecular libraries are well known in the art (see, for 
example, DeWitt et al, Proc. Natl. Acad. Scl U.S.A. 90, 6909, 1993; Erb et al Proc. 
Natl Acad. Set U.S.A. 91, 11422, 1994; Zuckermann et al, J. Med. Chem. 37, 2678, 

20 1994; Cho et al, Science 261, 1303, 1993; Carell et al, Angew. Chem. Int. Ed. Engl 
33, 2059, 1994; Carell et al, Angew. Chem. Int. Ed Engl 33, 2061; Gallop et al, J. 
Med. Chem. 37, 1233, 1994). Libraries of compounds can be presented in solution 
(see, e.g., Houghten, BioTechniques 13, 412-421, 1992), or on beads (Lam, Nature 
354, 82-84, 1991), chips (Fodor, Nature 364, 555-556, 1993), bacteria or spores 

25 (Ladner, U.S. Patent 5,223,409), plasmids (Cull et al, Proc. Natl. Acad. Sci. U.S.A. 
89, 1865-1869, 1992), or phage (Scott & Smith, Science 249, 386-390, 1990; Devlin, 
Science 249, 404-406, 1990); Cwirla et al, Proc. Natl. Acad. Scl 97, 6378-6382, 
1990; Felici, J. Mol Biol 222, 301-310, 1991; and Ladner, U.S. Patent 5,223,409). 
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High through-put screening 

Test compounds can be screened for the ability to bind to receptor tyrosine 
phosphatase polypeptides . or polynucleotides or to affect receptor tyrosine 
5 phosphatase activity or receptor tyrosine phosphatase gene expression using high 
throughput screening. Using high throughput screening, many discrete compounds 
can be tested in parallel so that large numbers of test compounds can be quickly 
screened. The most widely established techniques utilize 96-well microtiter plates. 
The wells of the microtiter plates typically require assay volumes that range from 50 
10 to 500 [il. In addition to the plates, many instruments, materials, pipettors, robotics, 
plate washers, and plate readers are commercially available to fit the 96-well format. 

Alternatively, "free format assays," or assays that have no physical barrier between 
samples, can be used. For example, an assay using pigment cells (melanocytes) in a 

15 simple homogeneous assay for combinatorial peptide libraries is described by 
Jayawickreme et al, Proc. Natl Acad, Sci. U.S.A. 19, 1614-18 (1994). The cells are 
placed under agarose in petri dishes, then beads that carry combinatorial compounds 
are placed on the surface of the agarose. The combinatorial compounds are partially 
released the compounds from the beads. ' Active compounds can be visualized as. 

20 dark pigment areas because, as the compounds difiuse locally into the gel matrix, the 
active compounds cause the cells to change colors. 

Another example of a free format assay is described by Chelsky, "Strategies for 
Screening Combinatorial Libraries: Novel and Traditional Approaches," reported at 

25 the First Annual Conference of The Society for Biomolecular Screening in 
Philadelphia, Pa. (Nov. 7-10, 1995). Chelsky placed a simple homogenous enzyme 
assay for carbonic anhydrase inside an agarose gel such that the enzyme in the gel 
would cause a color change throughout the gel. Thereafter, beads carrying" 
combinatorial compounds via a photolinker were placed inside the gel and the 

30 compounds were partially released by UV-light. Compounds that inhibited the 
enzyme were observed as local zones of inhibition having less color change. 
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Yet another example is described by Salmon et ah, Molecular Diversity 2, 57-63 
(1996). In this example, combinatorial libraries were screened for compounds that 
had cytotoxic effects on cancer cells growing in agar. 

5 

Another high throughput screening method is described in Beutel et ai, U.S. Patent 
5,976,813. In this method, test samples are placed in a porous matrix. One or more 
assay components are then placed within, on top of, or at the bottom of a matrix such 
as a gel, a plastic sheet, a filter, or other form of easily manipulated solid support. 
10 When samples are introduced to the porous matrix they diffuse sufficiently slowly, 
such that the assays can be performed without the test samples running together. 

Binding assays 

15 For binding assays, the test compound is preferably a small molecule that binds to 
and occupies, for example, the active site of the receptor tyrosine phosphatase 
polypeptide, . such that normal biological activity is prevented. Examples of such 
small molecules include, but are not limited to, small peptides or peptide-like 
molecules. 

20 

In binding assays, either the test compound or the receptor tyrosine phosphatase 
polypeptide can comprise a detectable label, such as a fluorescent, radioisotopic, 
chemiluminescent, or enzymatic label, such as horseradish peroxidase, alkaline phos- 
phatase, or luciferase. Detection of a test compound that is bound to the receptor 
25 tyrosine phosphatase polypeptide can then be accomplished, for example, by direct 
counting of radioemmission, by scintillation counting, or by determining conversion 
of an appropriate substrate to a detectable product. 

Alternatively, binding of a test compound to a human receptor tyrosine phosphatase 
30 polypeptide can be determined without labeling either of the interactants. For 
example, a microphysiometer can be used to detect binding of a test compound with 
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a human receptor tyrosine phosphatase polypeptide. A microphysiometer (e.g., 
' Cytosensor™) is an analytical instrument that measures the rate at which a cell 
acidifies its environment using a light-addressable potentiometric sensor (LAPS). 
Changes in this acidification rate can be used as an indicator of the interaction 
5 between a test compound and a human receptor tyrosine phosphatase polypeptide 
(McConnell etal, Science 257, 1906-1912, 1992). 

Determining the ability of a test compound to bind to a human receptor tyrosine 
phosphatase polypeptide also can be accomplished using a technology such as 

10 real-time Bimolecular Interaction Analysis (BIA) (Sjolander & Urbaniczky, Anal. 
Chem. 63, 2338-2345, 1991, and Szabo et al, Curr. Opin. Struct. Biol 5, 699-705, 
1995). BIA is a technology for studying biospecific interactions in real time, without 
labeling any of the interactants (e.g., BIAcore™). Changes in the optical 
phenomenon surface plasmon resonance (SPR) can be used as an indication of 

1 5 real-time reactions between biological molecules. 

In yet another aspect of the invention, a human receptor tyrosine phosphatase poly- 
peptide can be used as a "bait protein" in a two-hybrid assay or three-hybrid assay 
(see, e.g., U.S. Patent 5,283,317; Zervos etal, Cell 72, 223-232, 1993; Madura etal, 
20 J. Biol. Chem. 268, 12046-12054, 1993; Bartel et al, BioTechniques 14, 920-924, 
1993; Iwab.uchi et ah, Oncogene 8, 1693-1696, 1993; and Brent W094/10300), to 
identify other proteins which bind to or interact with the receptor tyrosine 
phosphatase polypeptide and modulate its activity. 

25 The two-hybrid system is based on the modular nature of most transcription factors, 
which consist of separable DNA-binding and activation domains. Briefly, the assay 
utilizes two different DNA constructs. For example, in one construct, polynucleotide 
encoding a human receptor tyrosine phosphatase polypeptide can be fused to a poly- 
nucleotide encoding the DNA binding domain of a known transcription factor (e.g., 

30 GAL-4). In the other construct a DNA sequence that encodes an unidentified protein 
("prey" or "sample") can be fused to a polynucleotide that codes for the activation 
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domain of the known transcription factor. If the "bait" and the "prey" proteins are 
able to interact in vivo to form an protein-dependent complex, the DNA-binding and 
activation domains of the transcription factor are brought into close proximity. This 
proximity allows transcription of a reporter gene (e.g., LacZ), which is operably 
5 linked to a transcriptional regulatory site responsive to the transcription factor. 
Expression of the reporter gene can be detected, and cell colonies containing the 
functional transcription factor can be isolated and used to obtain the DNA sequence 
encoding the protein that interacts with the receptor tyrosine phosphatase 
polypeptide. 

10 . . 

It may be desirable to immobilize either the receptor tyrosine phosphatase poly- 
peptide (or polynucleotide) or the test compound to facilitate separation of bound 
from unbound forms of one or both of the interactants, as well as to accommodate 
automation of the assay. Thus, either the receptor tyrosine phosphatase polypeptide 

15 (or polynucleotide) or the test compound can be bound to a solid support. Suitable 
solid supports include, but are not limited to, glass or plastic slides, tissue culture 
plates, microtiter wells, tubes, silicon chips, or particles such as beads (including, but 
not limited to, latex, polystyrene, or glass beads). Any method known in the art can 
be used to attach the enzyme polypeptide (or polynucleotide) or test compound to a 

20 solid support, including use of covalent and non-covalent linkages, passive ab- 
sorption, or pairs of binding moieties attached respectively to the polypeptide (or 
polynucleotide) or test compound and the solid support. Test compounds are 
preferably bound to the solid support in an array, so that the location of individual 
test compounds can be tracked. Binding of a test compound to a human receptor 

25 ' tyrosine phosphatase polypeptide (or polynucleotide) can be accomplished in any 
vessel suitable for containing the reactants. Examples of such vessels include 
microtiter plates, test tubes, and microcentrifuge tubes. 

In one embodiment, the receptor tyrosine phosphatase polypeptide is a fusion protein 
30 comprising a domain that allows the receptor tyrosine phosphatase polypeptide to be 
bound to a solid support.. For example, glutathione-S-transferase fusion proteins can 
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be adsorbed onto glutathione sepharose beads (Sigma Chemical, St. Louis, Mo.) or 
glutathione derivatized microtiter plates, which are then combined with the test 
compound or the test compound and the non-adsorbed receptor tyrosine phosphatase 
polypeptide; the mixture is then incubated under conditions conducive to complex 
5 formation (e.g., at physiological conditions for salt and pH). Following incubation, 
the beads or microtiter plate wells are washed to remove any unbound components. 
Binding of the interactants can be determined either directly or indirectly, as 
described above. Alternatively, the complexes can be dissociated from the solid 
support before binding is determined. 

10 

Other techniques for immobilizing proteins or polynucleotides on a solid support also 
can be used in the screening assays of the invention. For example, either a human 
receptor tyrosine phosphatase polypeptide (or polynucleotide) or a test compound 
can be immobilized utilizing conjugation of biotin and streptavidin. Biotinylated 

15 receptor tyrosine phosphatase polypeptides (or polynucleotides) or test compounds 
can be prepared from biotin-NHS(N- hydroxysuccinimide) using techniques well 
known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, 111.) and 
immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). 
Alternatively, antibodies which specifically bind to a receptor tyrosine phosphatase 

20 polypeptide, polynucleotide, or a test compound, but which do not interfere with a 
desired binding site, such as the active site of the receptor tyrosine phosphatase 
polypeptide, can be derivatized to the wells of the plate. Unbound target or protein 
can be trapped in the wells by antibody conjugation. 

25 Methods for detecting such complexes, in addition to those described above for the 
GST-immobilized complexes, include immunodetection of complexes using anti- 
bodies which specifically bind to the receptor tyrosine phosphatase polypeptide or 
test compound, enzyme-linked assays which rely on detecting an activity of the: 
• receptor tyrosine phosphatase polypeptide, and SDS gel electrophoresis under non- 
30 reducing conditions. 
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Screening for test compounds which bind to a human receptor tyrosine phosphatase 
polypeptide or polynucleotide also can be carried out in an intact cell. Any cell 
which comprises a receptor tyrosine phosphatase polypeptide or polynucleotide can 
be used in a cell-based assay system. A receptor tyrosine phosphatase polynucleotide 
5 can be naturally occurring in the cell or can be introduced using techniques such as 
those described above. Binding of the test compound to a receptor tyrosine phos- 
phatase polypeptide or polynucleotide is determined as described above. 

Enzymatic activity 

10 

Test compounds can be tested for the ability to increase or decrease the enzymatic 
activity of a human receptor tyrosine phosphatase polypeptide. Enzymatic activity 
can be measured, for example, as described in Calvert-Evers & Hammond, Cell Biol 
Int. 24, 559-68, 2000. 

15 

Enzyme assays can be carried out after contacting either a purified receptor tyrosine 
phosphatase polypeptide, a cell membrane preparation, or an intact cell with a test 
compound. A test compound that decreases enzymatic activity of a human receptor 
tyrosine phosphatase polypeptide by at least about 10, preferably about 50, more 

20 preferably about 75, 90, or 100% is identified as a potential therapeutic agent for 
decreasing receptor tyrosine phosphatase activity. A test compound which increases 
enzymatic activity of a human receptor tyrosine phosphatase polypeptide by at least 
about 10, preferably about 50, more preferably about 75, 90, or 100% is identified as 
a potential therapeutic agent for increasing human receptor tyrosine phosphatase 

25 activity. 

Gene expression 

In another embodiment, test compounds that increase or decrease receptor tyrosine 
30 phosphatase gene expression are identified. A receptor tyrosine phosphatase poly- 
nucleotide is contacted with a test compound, and the expression of an RNA or 
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polypeptide product of the receptor tyrosine phosphatase polynucleotide is 
determined. The level of expression of appropriate mRNA or polypeptide in the 
presence of the test compound is compared to the level of expression of mRNA or 
polypeptide in the absence of the test compound. The test compound can then be 
5 identified as a modulator of expression based on this comparison. For example, 
when expression of mRNA or polypeptide is greater in the presence of the test 
compound than in its absence, the test compound is identified as a stimulator or en- 
hancer of the mRNA or polypeptide expression. Alternatively, when expression of 
the mRNA or polypeptide is less in the presence of the test compound than in its 
10 absence, the test compound is identified as an inhibitor of the mRNA or polypeptide 
expression. 

The level of receptor tyrosine phosphatase mRNA or polypeptide expression in the 
cells can be determined by methods well known in the art for detecting mRNA or 

15 polypeptide. Either qualitative or quantitative methods can be used. The presence of 
polypeptide products of a human receptor tyrosine phosphatase polynucleotide can 
be determined, for example, using a variety of techniques known in the art, including 
immunochemical methods such as radioimmunoassay, Western blotting, and 
immunohistochemistxy. Alternatively, polypeptide synthesis can be determined in 

20 vivo, in a cell culture, or in an in vitro translation system by detecting incorporation 
of labeled amino acids into a human receptor tyrosine phosphatase polypeptide. 

Such screening can be carried out either in a cell-free assay system or in an intact 
cell. Any cell that expresses a human receptor tyrosine phosphatase polynucleotide 
25 can be used in a cell-based assay system. The receptor tyrosine phosphatase poly- 
nucleotide can be naturally occurring in the cell or can be introduced using 
techniques such as those described above. Either a primary culture or an established 
cell line, such as CHO or human embryonic kidney 293 cells, can be used. 
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Pharrnaceurtcal compositions 

The invention also provides pharmaceutical compositions that can be administered to 
a patient to achieve a therapeutic effect. Pharmaceutical compositions of the in- 
5 vention can comprise, for example, a human receptor tyrosine phosphatase 
polypeptide, receptor tyrosine phosphatase polynucleotide, ribozymes or antisense 
oligonucleotides, antibodies which specifically bind to a receptor tyrosine 
phosphatase polypeptide, or mimetics, activators, or inhibitors of a human receptor 
tyrosine phosphatase polypeptide activity. The compositions can be administered 
10 alone or in combination with at least one other agent, such as stabilizing compound, 
which can be administered in any sterile, biocompatible pharmaceutical canier, in- 
cluding, but not limited to, saline, buffered saline, dextrose, and water. The compo- 
sitions can be administered to a patient alone, or in combination with other agents, 
drugs or hormones. 

15 

In addition to the active ingredients, these pharmaceutical compositions can contain 
suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries 
that facilitate processing of the active compounds into preparations which can be 
used pharmaceutical^. Pharmaceutical compositions of the invention can be 

20 administered by any number of routes including, but not limited to, oral, intravenous, 
intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, 
subcutaneous, intraperitoneal, intranasal, parenteral, topical, sublingual, or rectal 
means. Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutical^ acceptable carriers well known in the art in dosages suitable for 

25 oral administration. Such carriers enable the pharmaceutical compositions to be 
formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, 
suspensions, and the like, for ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combination of 
30 active compounds with solid excipient, optionally grinding a resulting mixture, and 
processing the mixture of granules, after adding suitable auxiliaries, if desired, to 
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obtain tablets or dragee cores. Suitable excipients are carbohydrate or protein fillers, 
such as sugars, including lactose, sucrose, mannitol, or sorbitol; starch from corn, 
wheat, rice, potato, or other plants; cellulose, such as methyl cellulose, hydroxyprop- 
ylmethyl-cellulose, or sodium carboxymethylcellulose; gums including arabic and 
5 tragacanth; and proteins such as gelatin and collagen. Lf desired, disintegrating or 
solubilizing agents can be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, alginic acid, or a salt thereof, such as sodium alginate. 

Dragee cores can be used in conjunction with suitable coatings, such as concentrated 
10 sugar solutions, which also can contain gum arabic, talc, polyvinylpyrrolidone, 
carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and 
suitable organic solvents or solvent mixtures. Dyestuffs or pigments can be added to 
the tablets or dragee coatings for product identification or to characterize the quantity 
of active compound, Le., dosage. 

15 

Pharmaceutical preparations that can be used orally include push-fit capsules made of 
gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as 
glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with a 
filler or binders, such as lactose or starches, lubricants, such as talc or magnesium 
20 stearate, and, optionally, stabilizers. In soft capsules, the active compounds can be 
dissolved or suspended in suitable liquids, such as fatty oils, liquid, or liquid 
polyethylene glycol with or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration can be formulated 
25 in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks' solution, Ringer's solution, or physiologically buffered saline. Aqueous 
injection suspensions can contain substances that increase the viscosity of the 
suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran." 
Additionally, suspensions of the active compounds can be prepared as appropriate 
30 oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils 
such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, 



WO 03/033688 



PCT/EP02/11473 



-52- 

or liposomes. Non-lipid polycationic amino polymers also can be used for delivery. 
Optionally, the suspension also can contain suitable stabilizers or agents that increase 
the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. For topical or nasal administration, penetrants appropriate to the particular 
5 barrier to be permeated are used in the formulation. Such penetrants are generally 
known in the art. 

The pharmaceutical compositions of the present invention can be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, 

10 granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or 
lyophilizing processes. The pharmaceutical composition can be provided as a salt 
and can be formed with many acids, including but not limited to, hydrochloric, 
sulfuric, acetic, lactic, tartaric, malic, succinic, etc. Salts tend to be more soluble in 
aqueous or other protonic solvents than are the corresponding free base forms. In 

15 other cases, the preferred preparation can be a lyophilized powder which can contain 
any or all of the following: 1-50 mM histidine, 0.1%-2% sucrose, and 2-7% 
mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to use. ■ 

Further details on techniques for formulation and administration can be found in the 
20 latest edition of Remington's Pharmaceutical Sciences (Maack Publishing Co., 
Easton, Pa.). After pharmaceutical compositions have been prepared, they can be 
placed in an appropriate container and labeled for treatment of an indicated 
condition. Such labeling would include amount, frequency, and method of ad- 
ministration. 

25 

Therapeutic indications and methods 

Human receptor tyrosine phosphatase can be regulated to treat diabetes, CNS' 
disorders, COPD, and cardiovascular disorders. 

30 
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Diabetes mellitus is a common metabolic disorder characterized by an abnormal 
elevation in blood glucose, alterations in lipids and abnormalities (complications) in 
the cardiovascular system, eye, kidney and nervous system. Diabetes is divided into 
two separate diseases: type 1 diabetes (juvenile onset), which results from a loss of 
5 cells which make and secrete insulin, and type 2 diabetes (adult onset), which is 
caused by a defect in insulin secretion and a defect in insulin action. 

Type 1 diabetes is initiated by an autoimmune reaction that attacks the insulin 
secreting cells (beta cells) in the pancreatic islets. Agents that prevent this reaction 
10 from occurring or that stop the reaction before destruction of the beta cells has been 
accomplished are potential therapies for this disease. Other agents that induce beta 
cell proliferation and regeneration also are potential therapies. 

Type II diabetes is the most common of the two diabetic conditions (6% of the 
15 population). The defect in insulin secretion is an important cause of the diabetic 
condition and results from an inability of the beta cell to properly detect and respond 
to rises in blood glucose levels with insulin release. Therapies that increase the 
response by the beta cell to glucose would offer an important new treatment for this 
disease. 

20 

The defect in insulin action in Type II diabetic subjects is another target for 
therapeutic intervention. Agents that increase the activity of the insulin receptor in 
muscle, liver, and fat will cause a decrease in blood glucose and a normalization of 
plasma lipids. The receptor activity can be increased by agents that directly stimulate 
25 the receptor or that increase the intracellular signals from the receptor. Other thera- 
pies can directly activate the cellular end process, i.e. glucose transport or various 
enzyme systems, to generate an insulin-like effect and therefore a produce beneficial 
outcome. Because overweight subjects have a greater susceptibility to Type II' 
diabetes, any agent that reduces body weight is a possible therapy. 



30 
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Both Type I and Type diabetes can be treated with agents that mimic insulin action 
or that treat diabetic complications by reducing blood glucose levels. Likewise, 
agents that reduces new blood vessel growth can be used to treat the eye 
complications that develop in both diseases. 

5 

Central and peripheral nervous system disorders also can be treated, such as primary 
and secondary disorders after brain injury, disorders of mood, anxiety disorders, 
disorders of thought and volition, disorders of sleep and wakefulness, diseases of the 
motor unit, such as neurogenic and myopathic disorders, neurodegenerative disorders 
10 such as Alzheimer's and Parkinson's disease, and processes of peripheral and chronic 
pain. 

Pain that is associated with CNS disorders also can be treated by regulating the 
activity of human receptor tyrosine phosphatase. Pain which can be treated includes 

15 that associated with central nervous system disorders, such as multiple sclerosis, 
spinal cord injury, sciatica, failed back surgery syndrome, traumatic brain injury, 
epilepsy, Parkinson's disease, post-stroke, and vascular lesions in the brain and 
spinal cord (e.g., infarct, hemorrhage, vascular malformation). Non-central neuro- 
pathic pain includes that associated with post mastectomy pain, reflex sympathetic 

20 dystrophy (RSD), trigeminal neuralgiaradioculopathy, post-surgical pain, HIV/AIDS 
related pain, cancer pain, metabolic neuropathies (e.g., diabetic neuropathy, 
vasculitic neuropathy secondary to connective tissue disease), paraneoplastic 
polyneuropathy associated, for example, with carcinoma of lung, or leukemia, or 
lymphoma, or carcinoma of prostate, colon or stomach, trigeminal neuralgia, cranial 

25 neuralgias, and post-herpetic neuralgia. Pain associated with cancer and cancer treat- 
ment also can be treated, as can headache pain (for example, migraine with aura, 
migraine without aura, and other migraine disorders), episodic and chronic 
tension-type headache, tension-type like headache, cluster headache, and chronic' 
paroxysmal hemicrania. 



30 
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Central and peripheral nervous system disorders also can be treated, such as primary 
and secondary disorders after brain injury, disorders of mood, anxiety disorders, 
disorders of thought and volition, disorders of sleep and wakefulness, diseases of the 
motor unit, such as neurogenic and myopathic disorders, neurodegenerative disorders 
5 such as Alzheimer's and Parkinson's disease, and processes of peripheral and chronic 
pain. 

Pain that is associated with CNS disorders also can be treated by regulating the 
activity of human receptor tyrosine phosphatase. Pain which can be treated includes 

10 that associated with central nervous system disorders, such as multiple sclerosis, 
spinal cord injury, sciatica, failed back surgery syndrome, traumatic brain injury, 
epilepsy, Parkinson's disease, post-stroke, and vascular lesions in the brain and 
spinal cord (e.g., infarct, hemorrhage, vascular malformation). Non-central neuro- 
pathic pain includes that associated with post mastectomy pain, reflex sympathetic 

15 dystrophy (RSD), trigeminal neuralgiaradioculopathy, postsurgical pain, HIV/AIDS 
related pain, cancer pain, metabolic neuropathies (e.g., diabetic neuropathy, 
vasculitic neuropathy secondary to connective tissue disease), paraneoplastic 
polyneuropathy associated, for example, with carcinoma of lung, or leukemia, or 
lymphoma, or carcinoma of prostate, colon or stomach, trigeminal neuralgia, cranial 

20 neuralgias, and post-herpetic neuralgia. Pain associated with cancer and cancer 
treatment also can be treated, as can headache pain (for example, migraine with aura, 
migraine without aura, and other migraine disorders), episodic and chronic 
tension-type headache, tension-type like headache, cluster headache, and chronic 
paroxysmal hemicrania. 

25 

Cardiovascular diseases include the following disorders of the heart and the vascular 
system: congestive heart failure, myocardial infarction, ischemic diseases of the 
heart, all kinds of atrial and ventricular arrhythmias, hypertensive vascular diseases," 
and peripheral vascular diseases. 



30 
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Heart failure is defined as a pathophysiologic state in which an abnormality of 
cardiac function is responsible for the failure of the heart to pump blood at a rate 
commensurate with the requirement of the metabolizing tissue. It includes all forms 
of pumping failure, such as high-output and low-output, acute and chronic, 
5 right-sided or left-sided, systolic or diastolic, independent of the underlying cause. 

Myocardial infarction (MI) is generally caused by an abrupt decrease in coronary 
blood flow that follows a thrombotic occlusion of a coronary artery previously 
narrowed by arteriosclerosis. MI prophylaxis (primary and secondary prevention) is 
1 0 included, as well as the acute treatment of MI and the prevention of complications. 

Ischemic diseases are conditions in which the coronary flow is restricted resulting in 
a perfusion which inadequate to meet the myocardial requirement for oxygen. This 
group of diseases includes stable angina, unstable angina, and asymptomatic 
. 15 ischemia. 

Arrhythmias include all forms of atrial and ventricular tachyarrhythmias (atrial 
tachycardia, atrial flutter, atrial fibrillation, atrio-ventricular reentrant tachycardia, 
preexcitation syndrome, ventricular tachycardia, ventricular flutter, and ventricular 
20 fibrillation), as well as bradycardic forms of arrhythmias. 

Vascular diseases include primary as well as all kinds of secondary arterial 
hypertension (renal, endocrine, neurogenic, others). The disclosed gene and its 
product may be used as drug targets for the treatment of hypertension as well as for 
25 the prevention of all complications. 

Peripheral vascular diseases are defined as vascular diseases in which arterial and/or 
venous flow is reduced resulting in an imbalance between blood supply and tissue* 
oxygen demand. It includes chronic peripheral arterial occlusive disease (PAOD), 
30 acute arterial thrombosis and embolism, inflammatory vascular disorders, Raynaud's 
phenomenon, and venous disorders. 
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This invention farther pertains to the use of novel agents identified by the screening 
assays described above. Accordingly, it is within the scope of this invention to use a 
test compound identified as described herein in an appropriate animal model For 
5 example, an agent identified as described herein (e.g., a modulating agent, an anti- 
sense nucleic acid molecule, a specific antibody, ribozyme, or a human receptor 
tyrosine phosphatase polypeptide binding molecule) can be used in an animal model 
to determine the efficacy, toxicity, or side effects of treatment with such an agent. 
Alternatively, an agent identified as described herein can be used in an animal model 
10 to determine the mechanism of action of such an agent. Furthermore, this invention 
pertains to uses of novel agents identified by the above-described screening assays 
. for treatments as described herein. 

A reagent which affects receptor tyrosine phosphatase activity can be administered to 
15 a human cell, either in vitro or in vivo, to. reduce receptor tyrosine phosphatase 
activity. The reagent preferably binds to an expression product of a human receptor 
tyrosine phosphatase gene. If the expression product is a protein, the reagent is 
preferably an antibody. For treatment of human cells ex vivo, an antibody can be 
added to a preparation of stem cells that have been removed from the body. The cells 
20 can then be replaced in the same or another human body, with or without clonal 
propagation, as is known in the art. 

In one embodiment, the reagent is delivered using a liposome. Preferably, the lipo- 
some is stable in the animal into which it has been administered for at least about 30 

25 minutes, more preferably for at least about 1 hour, and even more preferably for at 
least about 24 hours. A liposome comprises a lipid composition that is capable of 
targeting a reagent, particularly a polynucleotide, to a particular site in an animal, 
such as a human. Preferably, the lipid composition of the liposome is capable of 
targeting to a specific organ of an animal, such as the lung, liver, spleen, heart brain, 

30 lymph nodes, and skin. 
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A liposome useful in the present invention comprises a lipid composition that is 
capable of fusing with the plasma membrane of the targeted cell to deliver its 
contents to the cell. Preferably, the transfection efficiency of a liposome is about 
0.5 fig of DNA per 16 nmole of liposome delivered to about 10 6 cells, more 
5 preferably about 1.0 fig of DNA per 16 nmole of liposome delivered to about 10 6 
cells, and even more preferably about 2.0 jig of DNA per 16 nmol of liposome de- 
livered to about 10 6 cells. Preferably, a liposome is between about 100 and 500 nm, 
more preferably between about 150 and 450 nm, and even more preferably between 
about 200 and 400 nm in diameter. 

10 

* . Suitable liposomes for use in the present invention include those liposomes 
standardly used in, for example, gene delivery methods known to those of skill in the 
art. More preferred liposomes include liposomes having a polycationic lipid 
composition and/or liposomes having a cholesterol backbone conjugated to polyeth- 
15 ylene glycol. Optionally, a liposome comprises a compound capable of targeting the 
liposome to a particular cell type, such as a cell-specific ligand exposed on the outer 
surface of the liposome. 

Complexing a liposome with a reagent such as an antisense oligonucleotide or 
• 20 ribozyme can be achieved using methods that are standard in the art (see, for 
example, U.S. Patent 5,705,151). Preferably, from about 0.1 jig to about 10 jig of 
polynucleotide is combined with about 8 nmol of liposomes, more preferably from, 
about 0.5 jig to about 5 \ig of polynucleotides are combined with about 8 nmol lipo- 
somes, and even more preferably about 1.0 |ig of polynucleotides is combined with 
25 about 8 nmol liposomes. 

In another embodiment, antibodies can be delivered to specific tissues in vivo using 
receptor-mediated targeted delivery. Receptor-mediated DNA delivery techniques 
are taught in, for example, Findeis et al Trends in Biotechnol 11, 202-05 (1993); 
30 Chiou et al, GENE THERAPEUTICS : METHODS AND APPLICATIONS OF DIRECT GENE 

Transfer (J.A. Wolff, ed.) (1994); Wu & Wu, 1 Biol Chem. 263, 621-24 (1988); 
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Wu et ah, 1 Biol. Chem. 269, 542-46 (1994); Zenke et ah, Proc. Natl Acad, Sci 
USA. 87, 3655-59 (1990); Wu et al, J. Biol Chem, 266, 338-42 (1991). 

Determination of a therapeutically effective dose 

5 

The determination of a therapeutically effective dose is well within the capability of 
those skilled in the art. A therapeutically effective dose refers to that amount of 
active ingredient which increases or decreases enzymatic activity relative to the 
enzymatic activity which occurs in the absence of the therapeutically effective dose. 

10 

For any compound, the therapeutically effective dose can be estimated initially either 
in cell culture assays or in animal models, usually mice, rabbits, dogs, or pigs. The 
animal model also can be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses 
1 5 and routes for administration in humans. 

Therapeutic efficacy and toxicity, e.g., ED 5 o (the dose therapeutically effective in 
50% of the population) and LD 50 (the dose lethal to 50% of the population), can be 
determined by standard pharmaceutical procedures in cell cultures or experimental 
20 animals. The dose ratio of toxic to therapeutic effects is the therapeutic index, and it 
can be expressed as the ratio, LD50/ED50. 

Pharmaceutical compositions that exhibit large therapeutic indices are preferred. The 
data obtained from cell culture assays and animal studies is used in formulating a 
25 range of dosage for human use. The dosage contained in such compositions is 
preferably within a range of circulating concentrations that include the ED 50 with 
little or no toxicity. The dosage varies within this range depending upon the dosage 
form employed, sensitivity of the patient, and the route of administration. 

30 ( The exact dosage will be determined by the practitioner, in light of factors related to 
the subject that requires treatment. Dosage and administration are adjusted to 
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provide sufficient levels of the active ingredient or to maintain the desired effect. 
Factors that can be taken into account include the severity of the disease state, 
general health of the subject, age, weight, and gender of the subject, diet, time and 
frequency of administration, drug combination^), reaction sensitivities, and 
tolerance/response to therapy. Long-acting pharmaceutical compositions can be 
administered every 3 to 4 days, every week, or once every two weeks depending on 
the half-life and clearance rate of the particular formulation. 

Normal dosage amounts can vary from 0.1 to 100,000 micrograms, up to a total dose 
of about 1 g, depending upon the route of administration. Guidance as to particular 
dosages and methods of delivery is provided in the literature and generally available 
to practitioners in the art. Those skilled in the art will employ different formulations 
for nucleotides than for proteins or their inhibitors. Similarly, delivery of poly- 
nucleotides or polypeptides will be specific to particular cells, conditions, locations, 
etc. 

If the reagent is a single-chain antibody, polynucleotides encoding the antibody can 
be constructed and introduced into a cell either ex vivo or in vivo using well- 
established techniques including, but not limited to, transferrin-polycation-mediated 
DNA transfer, transfection with naked or encapsulated nucleic acids, liposome- 
mediated cellular fusion, intracellular transportation of DNA-coated latex beads, 
protoplast fusion, viral infection, electroporation, "gene gun," and DEAE- or calcium 
phosphate-mediated transfection. 

Effective in vivo dosages of an antibody are in the range of about 5 \ig to about 
50 yg/kg, about 50 jig to about 5 mg/kg, about 100 jxg to about 500 |ig/kg of patient 
body weight, and about 200 to about 250 jig/kg of patient body ,weight. For 
administration of polynucleotides encoding single-chain antibodies, effective in vivo 
dosages are in the range of about 100 ng to about 200 ng, 500 ng to about 50 mg, 
about 1 jig to about 2 mg, about 5 jag to about 500 y,g, and about 20 \ig to about 
100|igofDNA. 
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If the expression product is mRNA, the reagent is preferably an antisense oligo- 
nucleotide or a ribozyme. Polynucleotides that express antisense oligonucleotides or 
ribozymes can be introduced into cells by a variety of methods, as described above. 

5 

Preferably, a reagent reduces expression of a human receptor tyrosine phosphatase 
gene or the activity of a receptor tyrosine phosphatase polypeptide by at least about 
10, preferably about 50, more preferably about 75, 90, or 100% relative to the 
absence of the reagent. The effectiveness of the mechanism chosen to decrease the 

1 0 level of expression of a human receptor tyrosine phosphatase gene or the activity of a 
human receptor tyrosine phosphatase polypeptide can be assessed using methods 
well known in the art, such as hybridization of nucleotide probes to receptor tyrosine 
phosphatase-specific mRNA, quantitative RT-PCR, immunologic detection of a 
human receptor tyrosine phosphatase polypeptide, or measurement of enzymatic 

15 activity. 

In any of the embodiments described above, any of the pharmaceutical compositions 
of the invention can be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination 
20 therapy can be made by one of ordinary skill in the art, according to conventional 
pharmaceutical principles. The combination of therapeutic agents can act syner- 
gistically to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efficacy with 
lower dosages of each agent, thus reducing the potential for adverse side effects. 

25 ■ 

Any of the therapeutic methods described above can be applied to any subject in 
need of such therapy, including, for example, mammals such as dogs, cats, cows, 
horses, rabbits, monkeys, and most preferably, humans. 
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Diagnostic methods 

Human receptor tyrosine phosphatase also can be used in diagnostic assays for 
detecting diseases and abnormalities or susceptibility to diseases and abnormalities 
5 , related to the presence of mutations in the nucleic acid sequences that encode the 
enzyme. For example, differences can be determined between the cDNA or 
genomic sequence encoding receptor tyrosine phosphatase in individuals afflicted 
with a disease and in normal individuals. If a mutation is observed in some or all of 
the afflicted individuals but not in normal individuals, then the mutation is likely to 
10 be the causative agent of the disease. 

Sequence differences between a reference gene and a gene having mutations can be 
revealed by the direct DNA sequencing method. In addition, cloned DNA segments 
can be employed as probes to detect specific DNA segments. The sensitivity of this 
15 method is greatly enhanced when combined with PCR. For example, a sequencing 
primer can be used with a double-stranded PCR product or a single-stranded template 
molecule generated by a modified PCR. The sequence determination is performed 
by conventional procedures using radiolabeled nucleotides or by automatic 
sequencing procedures using fluorescent tags. 

20 

Genetic testing based on DNA sequence differences can be carried out by detection 
of alteration in electrophoretic mobility of DNA fragments in gels with or without 
denaturing agents. Small sequence deletions and insertions can be visualized, for 
example, by high resolution gel electrophoresis. DNA fragments of different 

25 sequences can be distinguished on denaturing formamide gradient gels in which the 
mobilities of different DNA fragments are retarded in the gel at different positions 
according to their specific melting or partial melting temperatures {see, e.g., Myers et 
al., Science 230, 1242, 1985). Sequence changes at specific locations can also be' 
revealed by nuclease protection assays, such as RNase and S 1 protection or the 

30 chemical cleavage method {e.g., Cotton et al, Proc. Natl Acad. Set USA 85, 
4397-4401, 1985). Thus, the detection of a specific DNA sequence can be performed 
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by methods such as hybridization, RNase protection, chemical cleavage, direct DNA 
sequencing or the use of restriction enzymes and Southern blotting of genomic DNA. 
In addition to direct methods such as gel-electrophoresis and DNA sequencing, 
mutations can also be detected by in situ analysis. 

5 

Altered levels of receptor tyrosine phosphatase also can be detected in various 
tissues. Assays used to detect levels of the receptor polypeptides in a body sample, 
such as blood or a tissue biopsy, derived from a host are well known to those of skill 
in the art and include radioimmunoassays, competitive binding assays, Western blot 
10 analysis, and ELISA assays. 

All patents and patent applications cited in this disclosure are expressly incorporated 
herein by reference. The above disclosure generally describes the present invention. 
A more complete understanding can be obtained by reference to the following 
15 specific examples, which are provided for purposes of illustration only and are not 
intended to limit the scope of the invention. 

EXAMPLE 1 

Detection of receptor tyrosine phosphatase activity 

20 

The polynucleotide of SEQ ID NO: 1 is inserted into the expression vector pCEV4 
and the expression vector pCEV4-receptor tyrosine phosphatase polypeptide 
obtained is transfected into human embryonic kidney 293 cells. To prepare mem- 
brane fractions, the cells are sonicated in a hypotonic lysis buffer (25 mM Tris-HCl, 

25 pH 7.5/25 mM sucrose/0.1 mM EDTA/5 mM MgC12/5 mM DTT/1 mM 
phenylmethylsulfonyl fluoride/0.5 jxg/ml leupeptin/1 p.g/ml aprotinin), nuclei are 
removed by low-speed centrifugation, and membrane fractions are obtained by 
centrifugation at 100,000 x g for 60 min at 4°C. The resulting pellets are suspended' 
by sonication in lysis buffer, brought to a concentration of 2 mg/ml, and used to 

30 measure receptor tyrosine phosphatase activity.. A nonradioactive protein-tyrosine 
phosphatase assay kit (Roce Molecular Biochemicals) is used according to the 
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manufacturer's instructions. Signals on Western blots are detected by chemi- 
luminescence (ECL, Amersham Pharmacia Biotech. It is shown that the polypeptide 
of SEQ ID NO: 2 has a receptor tyrosine phosphatase activity. 

5 EXAMPLE 2 

Expression of recombinant human receptor tyrosine phosphatase 

The Pichia pastoris expression vector pPICZB (Invitrogen, San Diego, CA) is used 
to produce large quantities of recombinant human receptor tyrosine phosphatase 

10 polypeptides in yeast. The receptor tyrosine phosphatase-encoding DNA sequence is 
derived from SEQ ID NO: 1 . Before insertion into vector pPICZB, the DNA sequence 
is modified by well known methods in such a way that it contains at its 5'-end an 
initiation codon and at its 3' -end an enterokinase cleavage site, a His6 reporter tag 
and a termination codon. Moreover, at both termini recognition sequences for 

15 restriction endonucleases are added and after digestion of the multiple cloning site of 
pPICZ B with the corresponding restriction enzymes the modified DNA sequence is 
ligated into pPICZB. This expression vector is designed for inducible expression in 
Pichia pastoris, driven by a yeast promoter. The resulting pPICZ/md-His6 vector is 
used to transform the yeast. 

20 

The yeast is cultivated under usual conditions in 5 liter shake flasks and the 
recombinantly produced protein isolated from the culture by affinity chromatography 
(Ni-NTA-Resin) in the presence of 8 M urea. The bound polypeptide is eluted with 
buffer, pH 3.5, and neutralized. Separation of the polypeptide from the His6 reporter 
25 tag is accomplished by site-specific proteolysis using enterokinase (Invitrogen, San 
Diego, CA) according to manufacturer's instructions. Purified human receptor 
tyrosine phosphatase polypeptide is obtained. 
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EXAMPLE3 

Identification of test compounds that bind to receptor tyrosine phosphatase 
polypeptides 

5 Purified receptor tyrosine phosphatase polypeptides comprising a glutathione-S- 
transferase protein and absorbed onto glutathione-derivatized wells of 96-well micro- 
liter plates are contacted with test compounds from a small molecule library at pH 
7,0 in a physiological- buffer solution. Human receptor tyrosine phosphatase 
polypeptides comprise the amino acid sequence shown in SEQ ID NO:2. The test 
10 compounds comprise a fluorescent tag.. The samples are incubated for 5 minutes to 
one hour. Control samples are incubated in the absence of a test compound. 

' The buffer solution containing the test compounds is washed from the wells. 
Binding of a test compound to a human receptor tyrosine phosphatase polypeptide is 
15 detected by fluorescence measurements of the contents of the wells. A test 
compound that increases the fluorescence in a well by at least 15% relative to 
fluorescence of a well in which a test compound is not incubated is identified as a 
compound which binds to a human receptor tyrosine phosphatase polypeptide. 

20 EXAMPLE 4 

Identification of a test compound which decreases receptor tyrosine phosphatase 
gene expression 

A test compound is administered to a culture of human cells transfected with a 
25 receptor tyrosine phosphatase expression construct and incubated at 37 °C for 10 to 
45 minutes. A culture of the same type of cells that have not been transfected is 
incubated for the same time without the test compound to provide a negative control. 
RNA is isolated from the two cultures as described in Chirgwin et al, Biochem. 18,' 
5294-99, 1979). Northern blots are prepared using 20 to 30 \xg total RNA and 
30 hybridized with a 32 P-labeled receptor tyrosine phosphatase-specific probe at 65 0 C 
in Express-hyb (CLONTECH). The probe comprises at least 11 contiguous 
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nucleotides selected from the complement of SEQ ID NO:l. A test compound that 
decreases the receptor tyrosine phosphatase-specific signal relative to the signal 
obtained in the absence of the test compound is identified as an inhibitor of receptor 
tyrosine phosphatase gene expression, 

5 

EXAMPLE 5 

Identification of a test compound which decreases receptor tyrosine phosphatase 
activity 

10 A test compound is administered to a culture of human cells transfected with a 
receptor tyrosine phosphatase expression construct and incubated at 37 °C for .10 to 
45 minutes. A culture of the same type of cells that have not been transfected is 
incubated for the same time without the test compound to provide a negative control. 
Enzymatic activity is measured using the method of Calvert-Evers & Hammond, Cell 

15 Biol Int. 24, 559-68, 2000. 

A test compound which decreases the enzymatic activity of the receptor tyrosine 
phosphatase relative to the enzymatic activity in the absence of the test compound is 
identified as an inhibitor of receptor tyrosine phosphatase activity. 

20 

EXAMPLE 6 

Tissue-specific expression of receptor tyrosine phosphatase 

The qualitative expression pattern of receptor tyrosine phosphatase in various tissues 
25 is determined by Reverse Transcription-Polymerase Chain Reaction (RT-PCR). 

Quantitative expression profiling. 

To demonstrate that receptor tyrosine phosphatase is involved in the disease process 
30 of diabetes, the following whole body panel is screened to show predominant or 
relatively high expression: subcutaneous and mesenteric adipose tissue, adrenal 
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gland, bone marrow, brain, colon, fetal brain, heart, hypothalamus, kidney, liver, 
lung, mammary gland, pancreas, placenta, prostate, salivary gland, skeletal muscle, 
small intestine, spleen, stomach, testis, thymus, thyroid, trachea, and uterus. Human 
islet cells and an islet cell library also are tested. As a final step, the expression of 
5 receptor tyrosine phosphatase in cells derived from normal individuals with the 
expression of cells derived from diabetic individuals is compared. 

To demonstrate that receptor tyrosine phosphatase is involved in CNS disorders, the 
following tissues are screened: fetal and adult brain, muscle, heart, lung, kidney, 
10 liver, thymus, testis, colon, placenta, trachea, pancreas, kidney, gastric mucosa, 
colon, liver, cerebellum, skin, cortex (Alzheimer's and normal), hypothalamus, 
cortex, amygdala, cerebellum, hippocampus, choroid, plexus, thalamus, and spinal 
cord. 

15 To demonstrate that receptor tyrosine phosphatase is involved in the disease process 
of COPD, the initial expression panel consists of RNA samples from respiratory 
tissues and inflammatory cells relevant to COPD: lung (adult and fetal), trachea, 
freshly isolated alveolar type II cells, cultured human bronchial epithelial cells, 
cultured small airway epithelial cells, cultured bronchial sooth muscle cells, cultured 

20 H441 cells (Clara-like), freshly isolated neutrophils and monocytes, and cultured 
monocytes (macrophage-like). Body map profiling also is carried out, using total 
RNA panels purchased from Clontech. The tissues are adrenal gland, bone marrow, 
brain, colon, heart, kidney, liver, lung, mammary gland, pancreas, prostate, salivary 
gland, skeletal muscle, small intestine, spleen, stomach, testis, thymus, trachea, 

25 thyroid, and uterus. 

Quantitative expression profiling is performed by the form of quantitative PGR 
analysis called "kinetic analysis" firstly described in Higuchi et al 9 BioTechnology' 
10, 413-17, 1992; and Higuchi et al, BioTechnology 11, 1026-30, 1993. The 
30 principle is that at any given cycle within the exponential phase of PCR, the amount 
of product is proportional to the initial number of template copies. 
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If the amplification is performed in the presence of an internally quenched 
fluorescent oligonucleotide (TaqMan probe) complementary to. the target sequence, 
the probe is cleaved by the 5 '-3' endonuclease activity of Taq DNA polymerase and 
5 a fluorescent dye released in the medium (Holland et aL, Proc. Natl Acad. Set 
U.S.A. 88, 7276-80, 1991). Because the fluorescence emission will increase in direct 
proportion to the amount of the specific amplified product, the exponential growth 
phase of PCR product can be detected and used to determine the initial template 
concentration (Heid et al. 9 Genome Res. 6, 986-94, 1996, and Gibson et al y Genome 
10 Res. <5, 995-1001, 1996). 

The amplification of an endogenous control can be performed to standardize the 
amount of sample RNA added to a reaction. In this kind of experiment, the control 
of choice is the 18S ribosomal RNA. Because reporter dyes with differing emission 
15 spectra are available, the target and the endogenous control can be independently 
quantified in the same tube if probes labeled with different dyes are used. 

All "real time PCR" measurements of fluorescence are made in the ABI Prism 7700. 

20 RNA extraction and cDNA preparation. Total RNA from the tissues listed above are 
used for expression quantification. RNAs labeled "from autopsy" were extracted 
from autoptic tissues with the TRIzol reagent (Life Technologies, MD) according to 
the manufacturer's protocol. 

25 Fifty (xg of each RNA were treated with DNase I for 1 hour at 37°C in the following 
reaction mix: 0.2 U/pl RNase-free DNase I (Roche Diagnostics, Germany); 0.4 U/pl 
RNase inhibitor (PE Applied Biosystems, CA); 10 mM Tris-HCl pH 7.9; lOmM 
MgCl 2 ; 50 mM NaCl; and 1 rnM DTT. 
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After incubation, RNA is extracted once with 1 volume of phenol:chloroform:iso- 
ainyl alcohol (24:24:1) and once with chloroform, and precipitated with 1/10 volume 
of 3 M sodium acetate, pH5.2, and 2 volumes of ethanol. 

5 Fifty jig of each RNA from the autoptic tissues are DNase treated with the DNA-free 
kit purchased from Ambion (Ambion, TX). After resuspension and spectrophoto- 
metric quantification, each sample is reverse transcribed with the TaqMan Reverse 
Transcription Reagents (PE Applied Biosystems, CA) according to the manu- 
facturer's protocol The final concentration of RNA in the reaction mix is 200ng/jiL. 
10 Reverse transcription is carried out with 2.5pM of random hexamer primers. 

TaqMan quantitative analysis. Specific primers and probe are designed according to 
the recommendations of PE Applied Biosystems; the probe can be labeled at the 5' 
end FAM (6-carboxy-fluorescein) and at the 3' end with TAMRA (6-carboxy-tetra- 
15 methyl-rhodamine). Quantification experiments are performed on 10 ng of reverse 
transcribed RNA from each sample. Each determination is done in triplicate. 

Total cDNA content is normalized with the simultaneous quantification (multiplex 
PCR) of the 18S ribosomal RNA using the Pre-Developed TaqMan Assay Reagents 
20 (PDAR) Control Kit (PE Applied Biosystems, CA). 

The assay reaction mix is as follows: IX final TaqMan Universal PCR Master Mix 
(from 2X stock) (PE Applied Biosystems, CA); IX PDAR control - 18S RNA (from 
20X stock); 300 nM forward primer; 900 nM reverse primer; 200 nM probe; 10 ng 
25 cDNA; and water to 25 

Each of the following steps are carried out once: pre PCR, 2 minutes at 50° C, and 
10 minutes at 95°C. The following steps are carried out 40 times: denaturation, 15 
seconds at 95°C, annealing/extension, 1 minute at 60°C. 
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The experiment is performed on an ABI Prism 7700 Sequence Detector (PE Applied 
Biosystems, CA). At the end of the run, fluorescence data acquired during PCR are 
processed as described in the AJBI Prism 7700 user's manual in order to achieve 
better background subtraction as well as signal linearity with the starting target 
5 quantity. 

EXAMPLE 7 

Diabetes: In vivo testing of compounds/target validation 
Glucose Production 

Over-production of glucose by the liver, due to an enhanced rate of gluconeogenesis, 
is the major cause of fasting hyperglycemia in diabetes. Overnight fasted normal rats 
15 or mice have elevated rates of gluconeogenesis as do streptozotocin-induced diabetic 
rats or mice fed ad libitum. Rats are made diabetic with a single intravenous in- 
jection of 40 mg/kg of streptozotocin while C57BL/KsJ mice are given 40-60 mg/kg 
i.p. for 5 consecutive days. Blood glucose is measured from tail-tip blood and then 
compounds are administered via different routes (p.o., i.p., i.v., s.c). Blood is 
collected at various times thereafter and glucose measured. Alternatively, 
compounds are administered for several days, then the animals are fasted overnight, 
blood is collected and plasma glucose measured. Compounds that inhibit glucose 
production will decrease plasma glucose levels compared to the vehicle-treated 
control group. 

Insulin Sensitivity 

Both ob/ob and db/db mice as well as diabetic Zucker rats are hyperglycemic,* 
hyperinsulinemic and insulin resistant. The animals are pre-bled, their glucose levels 
measured, and then they are grouped so that the mean glucose level is the same for 
each group. Compounds are administered daily either q.d. or b.i.d. by different 
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routes (p.o., i.p., s.c.) for 7-28 days. Blood is collected at various times and plasma 
glucose and insulin levels determined. Compounds that improve insulin sensitivity in 
these models will decrease both plasma glucose and insulin levels when compared to 
the vehicle-treated control group. 

5 

Insulin Secretion 

Compounds that enhance insulin secretion from the pancreas will increase plasma 
insulin levels and improve the disappearance of plasma glucose following the 

10 administration of a glucose load. When measuring insulin levels, compounds are 
administered by different routes (p.o., i.p., s.c. or i.v.) to overnight fasted normal rats 
or mice. At the appropriate time an intravenous glucose load (0.4g/kg) is given, 
blood is collected one minute later. Plasma insulin levels are determined. 
Compounds that enhance insulin secretion will increase plasma insulin levels 

15 compared to animals given only glucose. When mieasuring glucose disappearance, 
animals are bled at the appropriate time after compound administration, then given 
either an oral or intraperitoneal glucose load (lg/kg), bled again after 15, 30, 60 and . 
90 minutes and plasma glucose levels determined. Compounds that increase insulin 
levels will decrease glucose levels and the area-under-the glucose curve when 

20 compared to the vehicle-treated group given only glucose. 

Compounds that enhance insulin secretion from the pancreas will increase plasma 
insulin levels and improve the disappearance of plasma glucose following the ad- 
ministration of a glucose load. When measuring insulin levels, test compounds 

25 which regulate receptor tyrosine phosphatase are administered by different routes 
(p.o., i.p., s.c, or i.v.) to overnight fasted normal rats or mice. At the appropriate 
time an intravenous glucose load (0.4g/kg) is given, blood is collected one minute 
later. Plasma insulin levels are determined. Test compounds that enhance insulin' 
secretion will increase plasma insulin levels compared to animals given only glucose. 

30 When measuring glucose disappearance, animals are bled at the appropriate time 
after compound administration, then given either an oral or intraperitoneal glucose 
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load (lg/kg), bled again after 15, 30, 60, and 90 minutes and plasma glucose levels 
determined. Test compounds that increase insulin levels will decrease glucose levels 
and the area-under-the glucose curve when compared to the vehicle-treated group 
given only glucose. 

5 

Glucose Production 

Over-production of glucose by the liver, due to an enhanced rate of gluconeogenesis, 
is the major cause of fasting hyperglycemia in diabetes. Overnight fasted normal rats 

10 or mice have elevated rates of gluconeogenesis as do streptozotocin-induced diabetic 
rats or mice fed ad libitum. Rats are made diabetic with a single intravenous 
injection of 40 mg/kg of streptozotocin while C57BL/KsJ mice are given 40- 
60 mg/kg i.p. for 5 consecutive days. Blood glucose is measured from tail-tip blood 
and then compounds are administered via different routes (p.o., i.p., i.v., s.c). Blood 

15 is collected at various times thereafter and glucose measured. Alternatively, 
compounds are administered for several days, then the animals are fasted overnight, 
blood is collected and plasma glucose measured. Compounds that inhibit glucose 
production will decrease plasma glucose levels compared to the vehicle-treated 
control group. 

20 

Insulin Sensitivity 

Both ob/ob and db/db mice as well as diabetic Zucker rats are hyperglycemic, 
hyperinsulinemic and insulin resistant. The animals are pre-bled, their glucose levels 

25 measured, and then they are grouped so that the mean glucose level is the same for 
each group. Compounds are administered daily either q.d. or b.i.d by different 
routes (p.o., i.p., s.c.) for 7-28 days. Blood is collected at various times and plasma 
glucose and insulin levels determined. Compounds that improve insulin sensitivity in' 
these models will decrease both plasma glucose and insulin levels when compared to 

30 the vehicle-treated control group. 
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Insulin Secretion 

Compounds that enhance insulin secretion from the pancreas will increase plasma 
insulin levels and improve the disappearance of plasma glucose following the 
5 administration of a glucose load.. When measuring insulin levels, compounds are 
administered by different routes (p.o., i.p., s.c. or i.v.) to overnight fasted normal rats 
or mice. At the appropriate time an intravenous glucose load (0.4g/kg) is given, 
blood is collected one minute later. Plasma insulin levels are determined. 
Compounds that enhance insulin secretion will increase plasma insulin levels 

10 compared to animals given only glucose. When measuring glucose disappearance, 
animals are bled at the appropriate time after compound administration, then given 
either an oral or intraperitoneal glucose load (lg/kg), bled again after 15, 30, 60 and 
90 minutes and plasma glucose levels determined. Compounds that increase insulin 
levels will decrease glucose levels and the area-under-the glucose curve when 

15 compared to the vehicle-treated group given only glucose. 

EXAMPLE 8 

In vivo testing of compounds/target validation 

20 

Pain 

Acute pain. Acute pain is measured on a hot plate mainly in rats. Two variants of hot 
plate -testing are used: In the classical variant animals are put on a hot surface (52 to 

25 56 °C) and the latency time is measured until the animals show nocifensive behavior, 
such as stepping or foot licking. The other variant is an increasing temperature hot 
plate where the experimental animals are put on a surface of neutral temperature. 
Subsequently this surface is slowly but constantly heated until the animals begin to* 
lick a hind paw. The temperature which is reached when hind paw licking begins is a 

30 measure for pain threshold. 
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Compounds are tested against a vehicle treated control group. Substance application 
is performed at different time points via different application routes (i.v:, i.p., p.o., 
i.t., i.c.v., s.c, intradermal, transdermal) prior to pain testing. 

5 Persistent pain. Persistent pain is measured with the formalin or capsaicin test, 
mainly in rats. A solution of 1 to 5% formalin or 10 to 100 \ig capsaicin is injected 
into one hind paw of the experimental animal. After formalin or capsaicin application 
the animals show nocifensive reactions like flinching, licking and biting of the 
affected paw. The number of nocifensive reactions within a time frame of up to 
10 90 minutes is a measure for intensity of pain. ^ 

Compounds are tested against a vehicle treated control group. Substance application 
is performed at different time points via different application routes (i.v., i.p., p.o., 
i.t., i.e. v., s.c., intradermal, transdermal) prior to formalin or capsaicin administration. 

15 

Neuropathic, pain. Neuropathic pain is induced by different variants of unilateral 
sciatic nerve injury mainly in rats. The operation is performed under anesthesia. The 
first variant of sciatic nerve injury is produced by placing loosely constrictive 
ligatures around the common sciatic nerve. The second variant is the tight ligation of 

20 about the half of the diameter of the common sciatic nerve. In the next variant, a 
group of models is used in which tight ligations or transections are made of either the 
L5 and L6 spinal nerves, or the L% spinal nerve only. The fourth variant involves an 
axotomy of two of the three terminal branches of the sciatic nerve (tibial and 
common peroneal nerves) leaving the remaining sural nerve intact whereas the last 

25 variant comprises the axotomy of only the tibial branch leaving the sural and 
common nerves uninjured. Control animals are treated with a sham operation. 

Postoperatively, the nerve injured animals develop a chronic mechanical allodynia,' 
cold allodynioa, as well as a thermal hyperalgesia. Mechanical allodynia is measured 
30 by means of a pressure transducer (electronic von Frey Anesthesiometer, IITC 
Inc.-Life Science Instruments, Woodland Hills, SA, USA; Electronic von Frey 
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System, Somedic Sales AB, Horby, Sweden). Thermal hyperalgesia is measured by 
means of a radiant .heat source (Plantar Test, Ugo Basile, Comerio, Italy), or by 
means of a cold plate of 5 to 10 °C where the nocifensive reactions of the affected 
hind paw are counted as a measure of pain intensity. A further test for cold induced 
5 pain is the counting of nocifensive reactions, or duration of nocifensive responses 
after plantar administration of acetone to the affected hind limb. Chronic pain in 
general is assessed by registering the circadanian rhythms in activity (Surjo and 
Arndt, Universitat zu Koln, Cologne, Germany), and by scoring differences in gait 
(foot print patterns; FOOTPRINTS program, Klapdor et al., 1997. A low cost method 
10 to analyze footprint patterns. J. Neurosci. Methods 75, 49-54). 

Compounds are tested against sham operated and vehicle treated control groups. 
Substance application is performed at different time points via different application 
routes (i.v., i.p., p.o., i.t., i.c.v., s.c, intradermal, transdermal) prior to pain testing. 

15 

Inflammatory Pain Inflammatory pain is induced mainly in rats by injection of 
0.75 mg carrageenan or complete Freund's adjuvant into one hind paw. The animals 
develop an edema with mechanical allodynia as well as thermal hyperalgesia. 
Mechanical allodynia is measured by means of a pressure transducer (electronic von 

20 Frey Anesthesiometer, ETC Inc.-Life Science Instruments, Woodland Hills, SA, 
USA). Thermal hyperalgesia is measured by means of a radiant heat source (Plantar 
Test, Ugo Basile, Comerio, Italy, Paw thermal stimulator, G. Ozaki, University of 
California, USA). For edema measurement two methods are being used. In the first 
method, the animals are sacrificed and the affected hindpaws sectioned and weighed. 

25 The second method comprises differences in paw volume by measuring water 
displacement in a plethysmometer (Ugo Basile, Comerio, Italy), 



30 



Compounds are tested against uninflamed as well as vehicle treated control groups.' 
Substance application is performed at different time points via different application 
routes (i.v., i.p., p.o., i.t., i.c.v., s.c, intradermal, transdermal) prior to pain testing. 
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Diabetic neuropathic pain. Rats treated with a single intraperitoneal injection of 50 
to 80 mg/kg streptozotocin develop a profound hyperglycemia and mechanical 
allodynia within 1 to 3 weeks. Mechanical allodynia is measured by means of a 
pressure transducer (electronic von Frey Anesthesiometer, ETC Inc.-Life Science 
5 Instruments, Woodland Hills, SA, USA). 

Compounds are tested against diabetic and non-diabetic vehicle treated control 
groups. Substance application is performed at different time points via different 
application routes (i.v., i.p., p.o., i.t, i.c.v., sx., intradermal, transdermal) prior to 
10 pain testing. 

Parkinson's disease 

6-Hydroxydopamine (6-OH-DA) Lesion, 

15 

Degeneration of the dopaminergic nigrostriatal and striatopallidal pathways is the 
central pathological event in Parkinson's disease. This disorder has been mimicked 
experimentally in rats using single/sequential unilateral stereotaxic injections of 
6-OH-DA into the medium forebrain bundle (MFB). 

20 

Male Wistar rats (Harlan Winkelmann, Germany), weighing 200±250 g at the be- 
ginning of the experiment, are used. The rats are maintained in a temperature- and 
humidity-controlled environment under a 12 h light/dark cycle with free access to 
food and water when not in experimental sessions. The following in vivo protocols 
25 are approved by the governmental authorities. All efforts are made to minimize 
animal suffering, to reduce the number of animals used, and to utilize alternatives to 
in vivo techniques. 

Animals are administered pargyline on the day of surgery (Sigma, St. Louis, MO, 
30 USA; 50 mg/kg i.p.) in order to inhibit metabolism of 6-OHDA by monoamine 
oxidase and desmethylimipramine HC1 (Sigma; 25 mg/kg i.p.) in order to prevent 
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uptake of 6-OHDA by noradrenergic terminals. Thirty minutes later the rats are 
anesthetized with sodium pentobarbital (50 mg/kg) and placed in a stereotaxic frame. 
In order to lesion the DA nigrostriatal pathway 4 |al of 0.01% ascorbic acid-saline 
containing 8 jag of 6-OHDA HBr (Sigma) are injected into the left medial fore-brain 
5 bundle at a rate of 1 \iVmm (2.4 mm anterior, 1.49 mm lateral, -2.7 mm ventral to 
Bregma and the skull surface). The needle is left in place an additional 5 min to allow 
diffusion to occur. 

Stepping Test. Forelimb akinesia is assessed three weeks following lesion placement 
10 using a modified stepping test protocol. In brief, the animals are held by the 
experimenter with one hand fixing the hindlimbs and slightly raising the hind part 
above the surface. One paw is touching the table, and is then moved slowly sideways 
(5 s for 1 m), first in the forehand and then in the backhand direction. The number of 
adjusting steps is counted for both paws in the backhand and forehand direction of 
15 movement. The sequence of testing is right paw forehand and backhand adjusting 
stepping, followed by left paw forehand and backhand directions. The test is repeated 
three times on three consecutive days, after an initial training period of three days 
prior to the first testing. Forehand adjusted stepping reveals no consistent differences 
between lesioned and healthy control animals. Analysis is therefore restricted to 
20 backhand adjusted stepping. 

Balance Test Balance adjustments following postural challenge are also measured 
during the stepping test sessions. The rats are held in the same position as described 
in the stepping test and, instead of being moved sideways, tilted by the experimenter 

25 towards the side of the paw touching the table. This maneuver results in loss of 
balance and the ability of the rats to regain balance by forelimb movements is scored 
on a scale ranging from 0 to 3. Score 0 is given for a normal forelimb placement. 
When the forelimb movement is delayed but recovery of postural balance detected, 
score 1 is given. Score 2 represents a clear, yet insufficient, forelimb reaction, as 

30 evidenced by muscle contraction, but lack of success in recovering balance, and score 
3 is given for no reaction of movement. The test is repeated three times a day on each 
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side for three consecutive days after an initial training period of three days prior to 
the first testing. 

Staircase Test (Paw Reaching). A modified version of the staircase test is used for 
5 evaluation of paw reaching behavior three weeks following primary and secondary 
lesion placement. Plexiglass test boxes with a central platform and a removable 
staircase on each side are used. The apparatus is designed such that only the paw on 
the same side at each staircase can be used, thus providing a measure of independent 
• forelimb use. For each test the animals are left in the test boxes for 15 min. The 

10 double staircase is filled with 7x3 chow pellets (Precision food pellets, formula: P, 
purified rodent diet, size 45 mg; Sandown Scientific) on each side. After each test the 
number of pellets eaten (successfully retrieved pellets) and the number of pellets 
taken (touched but dropped) for each paw and the success rate (pellets eaten/pellets 
taken) are counted separately. After three days of food deprivation (12 g per animal 

15 per day) the animals are tested for 11 days. Full analysis is conducted only for the 
last five days. 

MPTP treatment. The neurotoxin l-methyl-4-phenyl-l,2,3,6- tetrahydro-pyridine 
(MPTP) causes degeneration of mesencephalic dopaminergic (DAergic) neurons in 
20 rodents, non-human primates, and humans and, in so doing, reproduces many of the 
symptoms of Parkinson's disease. MPTP leads to a marked decrease in the levels of 
dopamine and its metabolites, and in the number of dopaminergic terminals in the 
striatum as well as severe loss of the tyrosine hydroxylase (TH)-immunoreactive cell 
bodies in the substantia nigra, pars compacta. 

25 

In order to obtain severe and long-lasting lesions, and to reduce mortality, animals 
receive single injections of MPTP, and are then tested for severity of lesion 7-10 
days later. Successive MPTP injections are administered on days 1, 2 and 3. Animals' 
receive application of 4 mg/kg MPTP hydrochloride (Sigma) in saline once daily. All 
30 injections are intraperitoneal (i.p.) and the MPTP stock solution is frozen between 
injections. Animals are decapitated on day 11. 
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Irnmunohistology. At the completion of behavioral experiments, all animals are 
anaesthetized with 3 ml thiopental (1 g/40 ml i.p., Tyrol Phanna). The mice are 
perfused transcardially with 0.01 M PBS (pH 7.4) for 2 min, followed by 4% 
5 paraformaldehyde (Merck) in PBS for 15 min. The brains are removed and placed in 
4% paraformaldehyde for 24 h at 4 °C. For dehydration they are then transferred to a 
20% sucrose (Merck) solution in 0.1 M PBS at 4 °C until they sink. The brains are 
frozen in methylbutan at -20 °C for 2 min and stored at -70 °C. Using a sledge 
microtome (mod. 3800-Frigocut, Leica), 25 jam sections are taken from the genu of 
10 the corpus callosum (AP 1.7 mm) to the hippocampus (AP 21.8 mm) and from AP 
24.16 to AP 26.72. Forty-six sections are cut and stored in assorters in 0.25 M Tris 
buffer (pH 7.4) for immunohistochemistry. 

A series of sections is processed for free-floating tyrosine hydroxylase (TH) im- 
15 munohistochemistry. Following three rinses in 0.1 M PBS, endogenous peroxidase 
activity is quenched for 10 min in 0.3% H2O2 ±PBS. After rinsing in PBS, sections 
are preincubated in 10% normal bovine serum (Sigma) for 5 min as blocking agent 
and transferred to either primary anti-rat TH rabbit antiserum (dilution 1 :2000). 

20 Following overnight incubation at room temperature, sections for TH immuno-. 
reactivity are rinsed in PBS (2 xlO min) and incubated in biotinylated anti-rabbit 
immunoglobulin G raised in goat (dilution 1:200) (Vector) for 90 min, rinsed 
repeatedly and transferred to Vectastain ABC (Vector) solution for 1 h. 3 V 3' 
-Diaminobenzidine tetrahydrochloride (DAB; Sigma) in 0.1 M PBS, supplemented 

25 with 0.005% H2O2 , serves as chromogen in the subsequent visualization reaction. 
Sections are mounted on to gelatin-coated slides, left to dry overnight, 
counter-stained with hematoxylin dehydrated in ascending alcohol concentrations 
and cleared in butylacetate. Coverslips are mounted on entellan. 

30 Rotarod Test We use a modification of the procedure described by Rozas and 
Labandeira-Garcia (1997), with a CR-1 Rotamex system (Columbus Instruments, 
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Columbus, OH) comprising an IBM-compatible personal computer, a CIO-24 data 
acquisition card, a control unit, and a four-lane rotarod.unit. The rotarod unit consists 
of a rotating spindle (diameter 7.3 cm) and individual compartments for each mouse. 
The system software allows preprogramming of session protocols with varying 
5 rotational speeds (0-80 rpm). Infrared beams are used to detect when a mouse has 
fallen onto the base grid beneath the rotarod. The system logs the fall as the end of 
the experiment for that mouse, and the total time on the rotarod, as well as the time 
of the fall and all the set-up parameters, are recorded. The system also allows a weak 
current to be passed through the base grid, to aid training. 

10 

Dementia 

The object recognition task. The object recognition task has been designed to assess 
the effects of experimental manipulations on the cognitive performance of rodents. A 

15 rat is placed in an open field, in which two identical objects are present. The rats 
inspects both objects during the first trial of the object recognition task. In a second 
trial, after a retention interval of for example 24 hours, one of the two objects used in 
the first trial, the 'familiar' object, and a novel object are placed in the open field. 
The inspection time at each of the objects is registered. The basic measures in the OR 

20 task is the time spent by a rat exploring the two object the second trial. Good 
retention is reflected by higher exploration times towards the novel than the 
'familiar' object. 

Administration of the putative cognition enhancer prior to the first trial pre- 
25 dominantly allows assessment, of the effects on acquisition, and eventually on 
consolidation processes. Administration of the testing compound after the first trial 
allows to assess the effects on consolidation processes, whereas administration before 
the second trial allows to measure effects on retrieval processes. 

30 The passive avoidance task. The passive avoidance task assesses memory 
performance in rats and mice. The inhibitory avoidance apparatus consists of a 
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two-compartment box with a light compartment and a dark compartment. The two 
compartments are separated by a guillotine door that can be operated by the 
experimenter. A threshold of 2 cm (separates the two compartments when the 
guillotine door is raised. When the door is open, the illumination in the dark 
5 compartment is about 2 lux. The light intensity is about 500 lux at the center of the 
floor of the light compartment. 

Two habituation sessions, one shock session, and a retention session are given, 
separated by inter-session intervals of 24 hours. In the habituation sessions and the 
10 retention session the rat is allowed to explore the apparatus for 300 sec. The rat is 
placed in the light compartment, facing the wall opposite to the guillotine door. After 
an accommodation period of 15 sec. the guillotine door. is opened so that all parts of 
the apparatus can be visited freely. Rats normally avoid brightly lit areas and will 
enter the dark compartment within a few seconds. 

15 

In the shock session the guillotine door between the compartments is lowered as soon 
as the rat has entered 'the dark compartment with its four paws, and a scrambled 1 
mA footshock is administered for 2 sec. The rat is removed from the apparatus and 
put back into its home cage. The procedure during the retention session is identical to 
20 that of the habituation sessions. 

The. step-through latency, that is the first latency of entering the dark compartment 
(in sec.) during the retention session is an index of the memory performance of the 
animal; the longer the latency to enter the dark compartment, the better the retention 
25 is. A testing compound in given half an hour before the shock session, together with 
1 mg*kg _1 scopolamine. Scopolamine impairs the memory performance during the 
retention session 24 hours later. If the test compound increases the enter latency 
compared with the scopolamine-treated controls, is likely to possess cognition 
enhancing potential. 



30 
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Tlte Morris water escape task. The Morris water escape task measures spatial 
orientation learning in rodents. It is a test system that has extensively been used to 
investigate the effects of putative therapeutic on the cognitive functions of rats and 
mice. The performance of an animal is assessed in a circular water tank with an 
5 escape platform that is submerged about 1 cm below the surface of the water. The 
escape platform is not visible for an animal swimming in the water tank. Abundant 
extra-maze cues are provided by the furniture in the room, including desks, computer 
equipment, a second water tank, the presence of Jhe experimenter, and by a radio on a 
shelf that is playing softly. 

10 

The animals receive four trials during five daily acquisition sessions. A trial is started 
by placing an animal into the pool, facing the wall of the tank. Each of four starting 
positions in the quadrants north, east, south, and west is used once in a series of four 
trials; their order is randomized. The escape platform is always in the same position. 

15 A trial is terminated as soon as the animal had climbs onto the escape platform or 
when 90 seconds have elapsed, whichever event occurs first. The animal is allowed 
to stay on the platform for 30 seconds. Then it is taken from the platform and the 
next trial is started. If an animal did not find the platform within 90 seconds it is put 
on the platform by the experimenter and is allowed to stay there for 30 seconds. After 

20 the fourth trial of the fifth daily session, an additional trial is given as a probe trial: 
the platform is removed, and the time the animal spends in the four quadrants is 
measured for 30 or 60 seconds. In the probe trial, all animals start from the same start 
position, opposite to the quadrant where the escape platform had been positioned 
during acquisition. 

25 

Four different measures are taken to evaluate the performance of an animal during 
acquisition training: escape latency, traveled distance, distance to platform, and 
swimming speed. The following measures are evaluated for the probe trial: time (s)' 
in quadrants and traveled distance (cm) in the four quadrants. The probe trial 
30 provides additional information about how well an animal learned the position of the 
escape platform. If an animal spends more time and swims a longer distance in the 
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quadrant where the platform had been positioned during the acquisition sessions than 
in any other quadrant, one concludes that the platform position has been learned well. 

In order to assess the effects of putative cognition enhancing compounds, rats or 
5 mice with specific brain lesions which impair cognitive functions, or animals treated 
with compounds such as scopolamine or MK-801, which interfere with normal 
learning, or aged animals which suffer from cognitive deficits, are used. 

The T-maze spontaneous alternation task. The T-maze spontaneous alternation task 
10 (TeMCAT) assesses the spatial memory performance in mice. The start arm and the 
two goal arms of the T-maze are provided with guillotine doors which can be 
operated manually by the experimenter. A mouse is put into the start arm at the 
beginning of training. The guillotine door is closed. In the first trial, the 'forced trial 5 , 
either the left or right goal arm is blocked by lowering the guillotine door. After the 
15 mouse has been released from the start arm, it will negotiate the maze, eventually 
enter the open goal arm, and return to the start position, where it will be confined for 
5 seconds, by lowering the guillotine door. Then, the animal can choose freely 
between the left and right goal arm (all guillotine-doors opened) during 14 'free 
choice 5 trials. As soon a the mouse has entered one goal arm, the other' one is closed. 
20 The mouse eventually returns to the start arm and is free to visit whichever go alarm 
it wants after having been confined to the start arm for 5 seconds. After completion 
of 14 free choice trials in one session, the animal is removed from the maze. During 
training, the animal is never handled. 

25 The percent alternations out of 14 trials is calculated. This percentage and the total 
time needed to complete the first forced trial and the subsequent 14 free choice trials 
(in s) is analyzed. Cognitive deficits are usually induced by an injection of 
scopolamine, 30 min before the start of the training session. Scopolamine reduced the' 
per-cent alternations to chance level, or below. A cognition enhancer, which is 

30 always administered before the training session, will at least partially, antagonize the 
scopolamine-induced reduction in the spontaneous alternation rate. 
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EXAMPLE 9 

Identification of test compound efficacy in a COPD animal model 

5 

Guinea pigs are exposed on a single occasion to tobacco smoke for 50 minutes. 
Animals are sacrificed between 10 minutes and 24 hour following the end of the 
exposure and their lungs placed in RNAlater™. The lung tissue is homogenised ,and 
total RNA was extracted using a Qiagens RNeasy™ Maxi kit. Molecular Probes 
10 RiboGreen™ RNA quantitation method is used to quantify the amount of RNA in 
each sample. 

Total RNA is reverse transcribed, and the resultant cDNA is used in a real-time poly- 
merase chain reaction (PCR). The cDNA is added to a solution containing the sense 

1 5 and anti-sense primers and the 6-carboxy-tetramethyl-rhodamine labeled probe of the 
receptor tyrosine phosphatase gene. Cyclophilin is used as the housekeeping gene. 
The expression of the receptor tyrosine phosphatase gene is measured using the 
TaqMan real-time PCR system that generates an amplification curve for each sample. 
From this curve a threshold cycle value is calculated: the fractional cycle number at 

20 which the amount of amplified target reaches a fixed threshold. A sample containing 
many copies of the receptor tyrosine phosphatase gene will reach this threshold 
earlier than a sample containing fewer copies. The threshold is set at 0.2, and the 
threshold cycle Cj is calculated from the amplification curve. The Cj value for the 
receptor tyrosine phosphatase gene is normalized using the Cj. value for the house- 

25 keeping gene. 

Expression of the receptor tyrosine phosphatase gene is increased by at least 3-fold 
between 10 minutes and 3 hours post tobacco smoke exposure compared to air 
exposed control animals. 



30 
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Test compounds are evaluated as follows. Animals are pre-treated with a test com- 
pound between 5 minutes and 1 hour prior to the tobacco smoke exposure and they 
are then sacrificed up to 3 hours after the tobacco smoke exposure has been 
completed. Control animals are pre-treated with the vehicle of the test compound via 
5 the route of administration chosen for the test compound. A test compound that 
reduces the tobacco smoke induced upregulation of receptor tyrosine phosphatase 
gene relative to the expression seen in vehicle treated tobacco smoke exposed 
animals is identified as an inhibitor of receptor tyrosine phosphatase gene expression. 
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CLAIMS 

1 . An isolated polynucleotide being selected from the group consisting of: 

a. a polynucleotide encoding a receptor tyrosine phosphatase polypeptide 
5 comprising an amino acid sequence selected form the group 

consisting of: 

i. amino acid sequences which are at least about 87% identical to 
the amino acid sequence shown in SEQ ID NO: 2; and 

ii. the amino acid sequence shown in SEQ ID NO: 2. 
10 b. a polynucleotide comprising the sequence of SEQ ID NO: 1 ; 

c. a polynucleotide which hybridizes under stringent conditions to a 

polynucleotide specified in (a) and (b) and encodes a receptor tyrosine 

phosphatase polypeptide; 
cL a polynucleotide the sequence of which deviates from the 
15 polynucleotide sequences specified in (a) to, (c) due to the 

degeneration of the genetic code and encodes a receptor tyrosine 

phosphatase polypeptide; and 
e. a polynucleotide which represents a fragment, derivative or allelic 

variation of a polynucleotide sequence specified in (a) to (d) and 
20 encodes a receptor tyrosine phosphatase polypeptide. 

2. An expression vector containing any polynucleotide of claim 1 . 

3 . A host cell containing the expression vector of claim 2. 



25 



4. A substantially purified receptor tyrosine phosphatase polypeptide encoded 
by a polynucleotide of claim 1. 



5. 

30 



A method for producing a receptor tyrosine phosphatase polypeptide, wherein 
the method comprises the following steps: 
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a. culturing the host cell of claim 3 under conditions suitable for the 
expression of the receptor tyrosine phosphatase polypeptide; and 

b. recovering the receptor tyrosine phosphatase polypeptide from the 
host cell culture. 

A method for detection of a polynucleotide encoding a receptor tyrosine 
phosphatase polypeptide in a biological sample comprising the following 



a. hybridizing any polynucleotide of claim 1 to a nucleic acid material of 
10 a biological sample, thereby forming a hybridization complex; and 

b. detecting said hybridization complex. 

7. The method of claim 6, wherein before hybridization, the nucleic acid 
material of the biological sample is amplified. 

15 

8. A method for the detection of a polynucleotide of claim 1 or a receptor 
tyrosine phosphatase polypeptide of claim 4 comprising the steps of: 

a. contacting a biological sample with a reagent which specifically 
interacts with the polynucleotide or the receptor tyrosine phosphatase 

20 polypeptide and 

b. detecting the interaction 

9. A diagnostic kit for conducting the method of any one of claims 6 to 8. 

25 10. A method of screening for agents which decrease the activity of a receptor 
tyrosine phosphatase, comprising the steps of: 

a. contacting a test compound with any receptor tyrosine phosphatase 
polypeptide encoded by any polynucleotide of claiml ; 

b. detecting binding of the test compound to the receptor tyrosine 
30 phosphatase polypeptide, wherein a test compound which binds to the 
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polypeptide is identified as a potential therapeutic agent for decreasing 
the activity of a receptor tyrosine phosphatase. 



11. A method of screening for agents which regulate the activity of a receptor 
5 tyrosine phosphatase, comprising the steps of: 

a. contacting a test compound with a receptor tyrosine phosphatase 
polypeptide encoded by any polynucleotide of claim 1; and 

b. detecting a receptor tyrosine phosphatase activity of the polypeptide, 
wherein a test compound which increases the receptor tyrosine 

10 phosphatase activity is identified as a potential therapeutic agent for 

increasing the activity of the receptor tyrosine phosphatase, and 
wherein a test compound which decreases the receptor tyrosine phos- 
phatase activity of the polypeptide is identified as a potential thera- 
peutic agent for decreasing the activity of the receptor tyrosine phos- 

15 phatase. 

12. A method of screening for agents which decrease the activity of a receptor 
tyrosine phosphatase, comprising the step of: 



20 contacting a test compound with any polynucleotide of claim 1 and detecting 

binding of the test compound to the polynucleotide, wherein a test compound 
which binds to the polynucleotide is identified as a potential therapeutic agent 
for decreasing the activity of receptor tyrosine phosphatase. 

25 13. A method of reducing the activity of receptor tyrosine phosphatase, 
comprising the step of: 



30 



contacting a cell with a reagent which specifically binds to any' 
polynucleotide of claim 1 or any receptor tyrosine phosphatase polypeptide of 
claim 4, whereby the activity of receptor tyrosine phosphatase is reduced. 
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14. A reagent that modulates the activity of a receptor tyrosine phosphatase 
polypeptide or a polynucleotide wherein said reagent is identified by the 
method of any of the claim 10 to 12. 

15. A pharmaceutical composition, comprising: 

the expression vector of claim 2 or the reagent of claim 14 and a 
pharmaceutically acceptable carrier. 

16. Use of the expression vector of claim 2 or the reagent of claim 14 in the 
preparation of a medicament for modulating the activity of a receptor tyrosine 
phosphatase in a disease. 

17. Use of claim 16 wherein the disease is diabetes, a CNS disorder, obesity, 
COPD or a cardiovascular disorder. 
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Fig. 1 

atggattttc ttatcatttt tcttttactt tttattggga cttcagagac acaggttgat 
gtttccaatg tcgttcctgg tactaggtac gatataacca tctcttcaat ttctacaaca 
tacacctcac ctgttactag aatagtgaca acaaatgtaa caaaaccagg gcctccagtc 
ttcctagccg gggaaagagt cggatctgct gggattcttc tgtcttggaa tacaccacct 
aatccaaatg gaaggattat atcttacatt gtcaaatata aggaagtttg tccgtggatg 
caaacagtat atacacaagt cagatcaaag ccagacagtc tggaagttct tcttactaat 
cttaatcctg gaacaacata tgaaattaag gttgctgctg aaaacagtgc tggcattgga 
gtgtttagtg atccatttct cttccaaact gcagaaagtg ctccaggaaa agtggtgaat 
ctcacagttg aggcctacaa cgcttcagca gttaagctga tttggtattt acctcggcaa 
ccaaatggca aaattaccag cttcaagatt agtgtcaagc atgccagaag tgggatagta 
gtgaaagatg tctcaatcag agtagaggac attttgactg ggaaattgcc agaaagcaat 
gagaatagtg aatctttttt atggagtaca gccagccctt ctccaaccct tggtagagtt 
acacctccat cgcgtaccac acattcatca agcacgttga cacagaatga gatcagctct 
gtgtggaaag agcctatcag ttttgtagtg acacacttga gaccttatac aacatatctt 
tttgaagttt cagctgctac aactgaagca ggttatattg atagtacgat tgtcagaaca 
ccagaatcag tgcctgaagg accaccacaa aactgcgtaa caggcaacat cacaggaaag 
tccttttcaa ttttatggga cccaccaact atagtaacag ggaaatttag ttatagagtt 
gaattatatg gaccatcagg tcgcattttg gataacagca caaaagacct caagtttgca 
ttcactaacc taacaccatt tacaatgtat gatgtctata ttgcggctga aaccagtgca 
gggactgggc ccaagtcaaa tatttcagta ttcactccac cagatgttcc aggggcagtg 
tttgatttac aacttgcaga ggtagaatcc acgcaagtaa gaattacttg gaagaaacca 
cgacaaccaa atggaattat taaccaatac cgagtgaaag tgctagttcc agagacagga 
ataattttgg aaaatacttt gctcactgga aataatgagt atataaatga ccccatggct 
ccagaaattg tgaacatagt agagccaatg gtaggattat atgagggttc agcagagatg 
tcgtctgacc ttcactcact tgctacattt atatataaca gccatccaga taaaaacttt 
cctgcaagga atagagctga agaccagact tcaccagttg taactacaag gaatcagtat 
attactgaca ttgcagctga acagctgtct tatgttatca ggagacttgt acctttcact 
gagcacatga ttagtgtatc tgctttcacc atcatgggag aaggaccacc aacagttctc 
agtgttagga cacgtcagca agtgccaagc tccattaaaa ttataaacta taaaaatatt 
agttcttcat ctattttgtt atattgggat cctccagaat atcccaatgg aaaaataact 
cactatacga tttatgcaat ggaattggat acaaacagag cattccagat aactaccata 
gataacagct ttctcataac agggttaaag aaatacacaa aatacaaaat gagagtggca 
gcctcaaccc acgttggaga aagttctttg tctgaagaaa atgacatctt tgtgagaact 
tcagaagatg aaccggaatc atcacctcaa gatgtcgaag taattgatgt taccgcagat 
gaaataaggt tgaagtggtc accacccgaa aagcccaatg ggatcattat tgcttatgaa 
gtgctatata aaaatataga tactttatat atgaagaaca catcaacaac agacataata 
ttaaggaact taagacctca caccctctat aacatttctg taaggtctta caccagattt 
ggtcatggca atcaggtatc ttctttactc tctgtaagga cttcggagac tgtgcctgat 
agtgcaccag aaaatatcac ttacaaaaat atttcttctg gagagattga gctatcattc 
cttcccccaa gtagtcccaa tggaatcata caaaaatata caatttatct caagagaagt 
aatggaaatg aggaaagaac tataaataca acctctttaa cccaaaacat taaagtactg 
aagaaatata cccaatatat cattgaggtg tctgctagta cactcaaagg tgaaggagtt 
■cggagtgctc ccataagtat actgacggag gaagatgctc ctgattctcc ccctcaagac 
ttctctgtaa aacagttgtc tggtgtcacg gtgaagttgt catggcaacc acccctggag 
ccaaatggaa ttatccttta ttacacagtt tatgtctgga atagatcatc attaaaaact 
attaatgtca ctgaaacatc attggagtta tcagatttgg attataatgt tgaatacagt 
gcttatgtaa cagctagcac cagatttggt gatgggaaaa caagaagcaa tatcattagc 
tttcaaacac cagagggagc accaagcgat cctcccaaag atgtttatta tgcaaacctc 
agttcttcat caataattct tttctggaca cctccttcaa aacctaatgg gattatacaa 
tattactctg tttattacag aaatacttca ggtactttta tgcagaattt tacactccat 
gaagtaacca atgactttga caatatgact gtatccacaa ttatagataa actgacaata 
ttcagctact atacattttg gttaacagca agtacttcag ttggaaatgg gaataaaagc 
agtgacatca ttgaagtata cacagatcaa gacatacctg aagggtttgt tggaaacctg 
acttacgaat ccatttcgtc aactgcaata aatgtaagct gggtcccacc ggctcaacca 
aacggtctag tcttctacta tgtttcactg atcttacagc agactcctcg ccatgtgaga 
ccacctcttg ttacatatga gagaagcata tattttgata atctggaaaa atacactgat 
tatatattaa aaattactcc atcaacagaa aagggattct ctgataccta tactgcccag 
ctatacatca agactgaaga agatgtccca gaaacttcac caataatcaa cacttttaaa 
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Pig. 1 (continued) 

aacctttcct ctacctcagt tctcttatca tgggatcccc cagtaaagcc aaatggtgca 
ataataagtt atgatttaac tttacaagga ccaaatgaaa attattcttt cattacttct 
gataattaca taatattgga agagctttca ccatttacat tatatagctt ttttgctgcc 
gcaagaacta gaaaaggact tggtccttcc agtattcttt tcttttacac agatgagtca 
gtgccgttag cacctccaca aaatttgact ttaatcaact gtacttcaga ctttgtatgg 
ctgaaatgga gcccaagtcc tcttccaggt ggtattgtta aagtatatag ttttaaaatt 
catgaacatg aaactgacac tatatattat aagaatatat caggatttaa aactgaagcc 
aaacttgttg gactggaacc agtcagcacc tactctatcc gtgtatctgc gttcaccaaa 
gttggaaatg gcaatcaatt tagtaatgta gtaaaattca caacccaaga atcagttcca 
gatgtcgtgc agaatatgca gtgcatggca actagctggc agtcagtttt agtgaaatgg 
gatccaccca aaaaggcaaa tggaataata acgcagtata tggtaacagt tgaaaggaat 
tctacaaaag tttctcccca agatcacatg tacactttca taaagcttct tgccaatacc 
tcatatgtct ttaaagtaag agcttcaacc tcagctggtg aaggtgatga aagcacatgc 
catgtcagca cactacctga aacagttccc agtgttccca caaatattgc tttttctgat 
gttcagtcaa ctagtgcaac attgacatgg ataagacctg acactatcct tggctacttt 
caaaattaca aaattaccac tcaacttcgt gctcaaaaat gcaaagaatg ggaatccgaa 
gaatgtgttg aatatcaaaa aattcaatac ctctatgaag ctcacttaac .tgaagagaca 
gtatatggat taaagaaatt tagatggtat agattccaag ' tggctgccag caccaatgct 
ggctatggca atgcttcaaa ctggatttct acaaaaactc tgcctggccc tccagatggt 
cctcctgaaa atgttcatgt agtagcaaca tcacctttta gcatcagcat aagctggagt 
gaacctgctg tcattactgg accaacatgt tatctgattg atgtcaaatc ggtagataat 
gatgaattta atatatcctt catcaagtca aatgaagaaa ataaaaccat agaaattaaa 
gatttagaaa tattcacaag gtattctgta gtgatcactg catttactgg gaacattagt 
gctgcatatg tagaagggaa gtcaagtgct gaaatgattg ttactacttt agaatcagcc 
ccaaaggacc cacctaacaa catgacattt cagaagatac cagatgaagt tacaaaattt 
caattaacgt tccttcctcc ttctcaacct aatggaaata tccaagtata tcaagctctg 
gtttaccgag aagatgatcc tactgctgtc cagattcaca acctcagtat tatacagaaa 
accaacacat tcgtcattgc aatgctagaa ggactaaaag gtggacatac atacaatatc 
agtgtttacg cagtcaatag tgctggtgca ggtccaaagg ttccgatgag aataaccatg 
gatatcaaag ctccagcacg accaaaaacc aaaccaaccc ctatttatga tgccacagga 
aaactgcttg tgacttcaac aacaattaca atcagaatgc caatatgtta ctacagtgat 
gatcatggac caataaaaaa tgtacaagtg cttgtgacag aaacaggagc tcagcatgat 
ggaaatgtaa caaagtggta tgatgcatat tttaataaag caaggccata ttttacaaat 
gaaggctttc ctaaccctcc atgtacagaa ggaaagacaa agtttagtgg caatgaagaa 
atctacatca taggtgctga taatgcatgc atgattcctg gcaatgaaga caaaatttgc 
aatggaccac tgaaaccaaa aaagcaatac ttatttaaat ttagagctac aaatattatg 
ggacaattta ctgactctga ttattctgac cctgttaaga ctttagggga aggactttca 
gaaagaaccg tagagatcat tctttccgtc actttgtgta tcctttcaat aattctcctt 
ggaacagcta tttttgcatt tgcaagaatt cgacagaagc agaaagaagg tggcacatac 
tctcctcagg atgcagaaat tattgacact aaattgaagc tggatcagct catcacagtg 
gcagacctgg aactgaagga cgagagatta acgcggttac ttagttatag aaaatccatc 
aagccaataa gcaagaaatc cttcctgcaa catgttgaag agctttgcac aaacaacaac 
ctaaagtttc aagaagaatt ttcggaatta ccaaaatttc ttcaggatct ttcttcaact 
gatgctgatc tgccttggaa tagagcaaaa aaccgcttcc caaacataaa accatataat 
aataacagag taaagctgat agctgacgct agtgttccag gttcggatta tattaatgcc 
agctatattt ctggttattt atgtccaaat gaatttattg ctactcaagg tccactacca 
ggaacagttg gagatttttg gaaaatggtg tgggaaacca gagcaaaaac attagtaatg 
ctaacacagt gttttgaaaa aggacggatc agatgccatc agtattggcc agaggacaac 
aagccagtta ctgtctttgg agatatagtg attacaaagc taatggagga tgttcaaata 
gattggacta tcagggatct gaaaattgaa aggcatgggg attgcatgac tgttcgacag 
tgtaacttta ctgcctggcc agagcatggg gttcctgaga acagcgcccc tctaattcac 
tttgtgaagt tggttcgagc aagcagggca catgacacca cacctatgat tgttcactgc 
agtgctggag ttggaagaac tggagttttt attgctctgg accatttaac acaacatata 
aatgaccatg attttgtgga tatatatgga ctagtagctg aactgagaag tgaaagaatg 
tgcatggtgc agaatctggc acagtatatc tttttacacc agtgcattct ggatctctta 
tcaaataagg gaagtaatca gcccatctgt tttgttaact attcagcact tcagaagatg 
gactctttgg acgccatgga aggtgatgtt gagcttgaat gggaagaaac cactatg 
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Fig. 2 

MDFLIIFLLL FIGTSETQVD VSNWPGTRY DITISSISTT YTSPVTRIVT TNVTKPGPPV 
FLAGERVGSA GILLSWNTPP NPNGRIISYI VKYKEVCPWM QTVYTQVRSK PDSLEVLLTN 
LNPGTTYEIK VAAENSAGIG VFSDPFLFQT AESAPGKWN LTVEAYNASA VKLIWYLPRQ 
PNGKITSFKI SVKHARSGIV VKDVSIRVED ILTGKLPESN ENSESFLWST ASPSPTLGRV 
TPPSRTTHSS STLTQNEISS VWKEPISFW THLRPYTTYL FEVSAATTEA GYIDSTIVRT 
PESVPEGPPQ NCVTGNITGK SFSILWDPPT IVTGKFSYRV ELYGPSGRIL DNSTKDLKFA 
FTNLTPFTMY DVYIAAETSA GTGPKSNISV FTPPDVPGAV FDLQLAEVES TQVRITWKKP 
RQPNGIINQY RVKVLVPETG IILENTLLTG NNEYINDPMA . PEIVNIVEPM VGLYEGSAEM 
SSDLHSLATF IYNSHPDKNF PARNRAEDQT SPWTTRNQY ITDIAAEQLS YVIRRLVPFT 
EHMISVSAFT IMGEGPPTVL SVRTRQQVPS SIKIINYKNI SSSSILLYWD PPEYPNGKIT 
HYTIYAMELD TNRAFQITTI DNSFLITGLK KYTKYKMRVA ASTHVGESSL SEENDIFVRT 
SEDEPESSPQ DVEVIDVTAD EIRLKWSPPE KPNGIIIAYE VLYKNIDTLY MKNTSTTDII 
LRNLRPHTLY NISVRSYTRF GHGNQVSSLL SVRTSETVPD SAPEtflTYKN ISSGEIELSF 
LPPSSPNGII QKYTIYLKRS NGNEERTINT TSLTQNIKVL KKYTQYIIEV SASTLKGEGV 
RSAPISILTE EDAPDSPPQD FSVKQLSGVT VKLSWQPPLE PNGIILYYTV YVWNRSSLKT 
INVTETSLEL SDLDYNVEYS AYVTASTRFG DGKTRSNIIS FQTPEGAPSD PPKDVYYANL 
SSSSIILFWT PPSKPNGIIQ YYSVYYRNTS GTFMQNFTLH EVTNDFDNMT VSTIIDKLTI 
FSYYTFWLTA STSVGNGNKS SDIIEVYTDQ DIPEGFVGNL TYESISSTAI NVSWVPPAQP 
NGLVFYYVSL ILQQTPRHVR PPLVTYERSI YFDNLEKYTD YILKITPSTE KGFSDTYTAQ 
LYIKTEEDVP ETSPIINTFK NLSSTSVLLS WDPPVKPNGA IISYDLTLQG PNENYSFITS 
DNYIILEELS PFTLYSFFAA ARTRKGLGPS SILFFYTDES VPLAPPQNLT LINCTSDFVW 
LKWSPSPLPG GIVKVYSFKI HEHETDTIYY KNISGFKTEA KLVGLEPVST YSIRVSAFTK 
VGNGNQFSNV VKFTTQESVP DWQNMQCMA TSWQSVLVKW DPPKKANGII TQYMVTVERN 
STKVSPQDHM YTFIKLIiANT SYVFKVRAST SAGEGDESTC HVSTLPETVP SVPTNIAFSD 
VQSTSATLTW IRPDTILGYF QNYKITTQLR AQKCKEWESE ECVEYQKIQY LYEAHLTEET 
VYGLKKFRWY RFQVAASTNA GYGNASNWIS TKTLPGPPDG PPENVHWAT SPFSISISWS 
EPAVITGPTC YLIDVKSVDN DEFNISFIKS NEENKTIEIK DLEIFTRYSV VITAFTGNIS 
AAYVEGKSSA EMIVTTLESA PKDPPNNMTF QKIPDEVTKF QLTFLPPSQP NGNIQVYQAL 
VYREDD PTAV QIHNLSIIQK TNTFVIAMLE GLKGGHTYNI SVYAVNSAGA GPKVPMRITM 
DIKAPARPKT KPTPIYDATG KLLVTSTTIT IRMPICYYSD DHGPIKNVQV LVTETGAQHD 
GNVTKWYDAY FNKARPYFTN EGFPNPPCTE GKTKFSGNEE IYIIGADNAC MIPGNEDKIC 
NGPLKPKKQY LFKFRATNIM GQFTDSDYSD PVKTLGEGLS ERTVEIILSV TLCILSIILL 
GTAIFAFARI RQKQKEGGTY SPQDAEIIDT KLKLDQLITV ADLELKDERL TRLLSYRKSI 
KPISKKSFLQ HVEELCTNNN LKFQEEFSEL PKFLQDLSST DADLPWNRAK NRFPNIKPYN 
NNRVKLIADA SVPGSDYINA SYISGYLCPN EFIATQGPLP GTVGDFWKMV WETRAKTLVM 
LTQCFEKGRI RCHQYWPEDN KPVTVFGDIV ITKLMEDVQI DWTIRDLKIE RHGDCMTVRQ 
CNFTAWPEHG VPENSAPLIH FVKLVRASRA HDTTPMIVHC SAGVGRTGVF IALDHLTQHI - 
NDHDFVDIYG LVAELRSERM CMVQNLAQYI FLHQCILDLL SNKGSNQPIC FVNYSALQKM 
DSLDAMEGDV ELEWEETTM 
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Fig. 3 

MMDFHFSFIiFIiLIGTSESQVDVSSSPDGTGYDITLSSVSATTYSS 
PVSRTLATNVTKPGPPVFLAGERVGSAGILLSWNTPPNPNGRIISYVVKYKEVCPWMQT 
AYTRARAKPDSLEVLLTNLNPGTTYEIKVAAEOTAGIGV^ 
TVEALNYSAVNLIWYLPRQPNGKITSFKISVKHARSG 

ENSESFLWSTTSPSPTLGRVTPTVRTTQSSSTAARSKISSWKEPISFVVTHLRPYTTY 

LFEVSAVTTEAGYIDSTIVRTPESVPEGPPQNCIMGNVTGKAFSISWDPPTIVTGKFSY 

RVELYGPSGRILDNSTKDLRFAFTHLTPFTMYDVYVAAETSAGVGPKSNLSVFTPPDVP 

GAVFDLQIAEVEATEIRITWRKPRQPNGI ISQYRVKVSVLETGVVLENTLLTGQDES IS 

NPMSPEIMNLVDPMIGFYEGSGEMSSDIjHSPASFIYNSHPHNDFPASTRAEEQSSPVVT 

TRNQYMTDITAEQLSYVVRRLVPFTEHTISVSAFTIMGEGPPTVLTVRTREQVPSSIQI 

INYKHISSSSILLYWDPPEYPNGKITHYTIYATELDTNRAFQMTTVDNSFLITGLKKYT 

RYKMRVAASTHVGESSLSEE1TOIFVRTPEDEPESSPQDVQVTGVSPSELRLKWSPPEKP 

NGI 1 1 AYEVLYQNADTLF VKNTS TTD III SDLKPYTLYNI S IRS YTRLGHGNQSS SLLS 

VRTSETVPDSAPENITYKNISSGEIEISFLPPRSPNGIIQKYTIYLKRSNSHEARTINT 

TSLTQTIGGLKKYTHYVIEVSASTLKGEGIRSRPISILTEEDAPDSPPQNFSVKQLSGV 

TVMLSWQPPLEPNGIILYYTVYVVnDKSSLRAINATEASLVLSDLDYNVDYGACVTASTR 

FGDGNARSSIINFRTPEGEPSDPPNDVHYVNLSSSSIILFWTPPVKPNGIIQYYSVYYQ 

OTSGTFVQNFTLLQVTKESDNVTVSARIYRLAIFSYYTFWLTASTSVGNGNKSSD 

YTDQDIPEGPVGNLTFESISSTAIHVSWEPPSQPNGLVFYYLSLNLQQSPPRHMIPPLV 

TYENSIDFDDLEKYTDYIFKITPSTEKGFSETYTTQLHIKTEEDVPDTPPIINTFKNLS 

STSILLSWDPPLKPNGAILGYHLTLQGPHANHTFVTSGNHIVLEELSPFTLYSFFAAAR 

TMKGLGPSSILFFYTDESAPL71PPQNLTLINYTSDFVWLTWSPSPLPGGIVKVYSFKIH 

EHETDTVFYKNISGLQTDAKLEGLEPVSTYSVSVSAFTKVGNGNQYSNVVEFTTQESVP 

EAVRNIECVARDWQSVSWWDPPRKTNGIIIHYMITVGGNSTKVSPRDPTYTFTKLLPN 

TS YVFEVRAS TS AGEGNESRCD I STLPETVPS APTNVAFSNVQS TS ATLTWTKPDT I FG 

YFQNYKI TTQLRAQKCREWE PEE C I EHQKDQYL YEANQTEETVHGLKKFRWYRFQVAAS 

TNVGYSNASEWISTQTLPGPPDGPPENVHWATSPFGINISWSEPAVITGPTFYLIDVK 

S VDDDDFNI S FLKSNEENKTTE INNLEVFTRYSWI TAF VGNVSRAYTDGKS SAEVI IT 

TLESVPKDPPNNMTFQKJPDEVTKFQLTFLPPSQPNGNIRWQALVYREDDPTAVQIHN 

FS 1 1 QKTDTS 1 1 AMLEGIjKGGHTYNI S VYAINSAGAGPKVQMRI TMD I KAPARPKS KPI 

PIRDATGKLLVTS TTI T IRMP I C YYNDDHGPIRNVQVLVAETGAQQDGNVTKWYDAYFN 

KARPYFTNEGFPNPPCIEGKTKFSGNEEIYVIGADNACMIPGNEEKICNGPLKPKKQYL 

FKFRATNVMGQFTDSEYSDPIKTLGEGLSERTVEIILSVTLCILSIILLGTAIFAFVRI 

RQKQKEGGTYS PRDAE I IDTKFKLD QL I TVADLELKDERLTRIjIjS YRKS I KP I S KKS FL 

QHVEELCTNSNLKFQEEFSELPKFLQDLSSTDADLPV^^ 

DVSLPGSDYINASYVSGYLCPmFIATQGPLPGTVGDFWRMWETRTKTLVMLTQCFEK 
GRIRCHQYWPEDNKPVTVFGDIVITKLMEDIQIDWTIRDLKIERHGDCMTVRQCNFTGW 
PEHGVPENTTPLIHFVKLTOTSRAHDTTPMVVHCSAGVGRTGVFIALDHLTQHINNHDF 
VDIYGLVAELRSERMCIWQNLAQYIFLHQCILDLLSNKGGHQPVCFVNYSTLQKMSLD 
AMEGDVELEWEETTM 
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Fig. 4 

MDFLIIFLLLFIGTSETQVDVSNVVPGTRYDITISSISTTYTSPVTRIVTTNVTKPGPPV 
FLAGERVGSAGIIiLSWNTPPNPNGRIISYIVKYKEVCPWMQTVYTQVRSKPDSLEVIiLTN 
LNPGTTYEI KVAAI2tf S AGI GVFSD PF 

PNGKITSFKISVKHARSGIVVKDVSIRVEDILTGKLPESNENSESFLWSTASPSPTLGRV 
TPPSRTTHSSSTLTQNEISSVWKEPISFVVTHLRPYTTYLFEVSAATTEAGYIDSTIVRT 
PESATAAGTQANRVWSRPLANSNRPAAEGP 



Fig. 5 

LLGGKLTNRKBIOTKNPSVHHHHQ 

FDLQLAEVESTQVRITWKKPRQPNGIINQYRVKVLVPETGIILENTLLTGNNEYINDPMA 
PEIWIVEPMVGLYEGSAEMSSDLHSLATFIYNSHPDKNFPARiniAEDQTSPWTTRNQY 
ITDIAAEQLS YVIRRLVPFTEHMI SVS AFTIMGEGPPTVLS VRTRQQVPS S I KI INYKNI 
S S S S ILLYWDPPE YPNGKI THYTI YAMELDTNRAFQ ITTIDNS FL ITGLKKYTKYKMRVA 
ASTHVGESSLSEENDIFVRTSEDEPESSPQDVEVIDVTADEIRLKWSPPEKPNGIIIAYE 
VLYKNIDTLYMKOTSTTDIILRJSTLRPHTLYNISWSYTRFGHGNQVSSLLSWTSETVTD 
S APENI TYKNI S S GE IELS FLPPS S PNGI IQKYT I YLKRSNGNEERTINTTS LTQNIKVL 
KKYTQYI IEVSASTLKGEGVRSAPIS ILTEEDAPDSPPQDFSVKQLSGVTVKLSWQPPLE 
PNGIILYYTVYVWNRSSLKTINVTETSLELSDLDYNVEYSAYVTASTRFGDGKTRSNIIS 

fqtpegapsdppkdvyyanlssssiilfwtppski>ngiiqyysvyyrntsgtfmqnftlh 

evtndfd™tvstiidkltifsyytfwltastsvgngnkssdiievytdqdipegfvgnl 

tyesisstainvswvppaqpnglvfyyvslilqqtprhvrpplvtyersiyfdnlekytd 

yilkitpstekgfsdtytaqlyikteedvpetspiintfknlsstsvllswdppvkpnga 

i isydltlqgpnenysfitsdnyi ileelspftlysffaaartrkglgpss ilffytdes 

vplappqnltlinctsdfvwlkwspsplpggivkvysfkihehetdtiyyknisgfktea 

klvglepvstysirvsaftkvgngnqfsnvvkfttqesvpdwqnmqcmatswqsvlvkw 

dppkkangiitqyiwtvernstkvspqdhmytfikllantsyvfkvrastsagegdestc 

hvstlpetvpsvptniafsdvqstsatltwirpdtilgyfqnykittqlraqkckewese 

ec\^yqkiqylyeahlteetvyglkkfrwyrfqvaastnagygnasnwistk 

ppenvhvvatspfsisiswsepavitgptcylidvksvdndefnisfiksneenktieik 

dleiftryswitaftgnisaayvegkssaemivttlesapkdppnnmtfqkipdevtkf 

qltflppsqpngniqvyqalvyreddptavqihnlsiiqktntfviamleglkgghtyni 

SVYAVWSAGAGPKVPmiTI^IKAPARPKTKPTPIYDATGKLLVTSTTITIRMPI 
DHGP I KtTVQVLVTETGAQHDGNVTKWYDAYFNKARP YFTNEGFPNP P CTEGKTKFS GNEE 
IYIIGADNACMIPGmDKICNGPLKPKKQYLFKFRATNIMGQFTDSDYSDPVKTLGEGLS 
ERTVEIILSVTLCILSIILLGTAIFAFARIRQKQKEGGTYSPQDAEIIDTKLKLDQLITV 
ADLELKDERLTRLLS YRKS IKPI S KKS FLQHVEELCTNNNLKFQEEFSELPKFLQDLS ST 
DADLPWNRAKNRFPNIKPYNIJNRVKIjIADASVPGSDYINASYISGYLCPNEFIATQGPLP 
GTVGDFWKMVWETRAKTLVMLTQ CFEKGRIRCHQYWPEDNKPVTVFGDI VI TKLMEDVQ I 
DWTIRDLKIERHGDCMTVRQCNFTAWPEHGVPENSAPLIHFVKLVRASRAHDTTPMIVHC 
SAGVGRTGVFI ALDHLTQHINDHD FVD I YGLVAELRSERMCMVQNLAQ YI FLHQC ILDLL 
SNKGSNQPICFVNYSALQKMDSLDAMEGDVELEWEETTM 
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Fig. 6 



ATGGATTTTCTTATCATTTTTCTTTTACTTTTTATTGGGACTTCAGAGACACAGGTTGAT 
GTTTCCAATGTCGTTCCTGGTACTAGGTACGATATAACCATCTCTTCT^ATTTCTACAACA 
TACACCTCACCTGTTACTAGAATAGTGACAACAAATGTAACAAAACCAGGGCCTCCAGTC 
TTCCTAGCCGGGGAAAGAGTCGGATCTGCTGGGATTCTTCTGTCTTGGAATACACCACCT 
AATCCAAATGGAAGGATTATATCTTACATTGTCAAATATAAGGAAGTTTGTCCGTGGATG 
CAAACAGTATATACACAAGTCAGATCAAAGCCAGACAGTCTGGAAGTTCTTCTTACTAAT 
CTTAATCCTGGAACAACATATGAAATTAAGGTTGCTGCTGAAAACAGTGCTGGCATTGGA 
GTGTTTAGTGATCCATTTCTCTTCCAAACTGCAGAAAGTGCTCCAGGAAAAGTGGTGAAT 
CTCACAGTTGAGGCCTACAACGCTTCAGCAGTTAAGCTGATTTGGTATTTACCTCGGCAA 
CCAAATGGCAAAATTACCAGCTTCAAGATTAGTGTCAAGCATGCCAGAAGTGGGATAGTA 
GTGAAAGATGTCTCAATCAGAGTAGAGGACATTTTGACTGGGAAATTGCCAGAAAGCAAT 
GAGAATAGTGAATCTTTTTTATGGAGTACAGCCAGCCCTTCTCCAACCCTTGGTAGAGTT 
ACACCTCCATCGCGTACCACACATTCATCAAGCACGTTGACACAGAATGAGATCAGCTCT 
GTGTGGAAAGAGCCTATCAGTTTTGTAGTGACACACTTGAGACCTTATACAACATATCTT 
TTTGAAGTTTCAGCTGCTACAACTGAAGCAGGTTATATTGATAGTACGATTGTCAGAACA 
CCAGAATCAGCCACCGCTGCTGGTACCCAGGCTAACAGGGTCTGGAGTAGACCTCTAGCA 
AACTCCAACAGACCTGCAGCGGAGGGTCCT 
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Fig. 7 

CTGTTAGGAGGAAAACTAACAAACAGAAAGGACATCCACACCAAAAACCCATCTGTACAT 
CACCATCATCAAAGACCAAAAGTAGATAAAACCACAAAGATGGGGAAAAAACAGAGCAGA 
AAAACTGGAAACTCTAAAAAGCAGAGCACCTCTCCTCCTCCAAAGGATGGCAGGGCAGTG 
TTTGATTTACAACTTGCAGAGGTAGAATCCACGCAAGTAAGAATTACTTGGAAGAAACCA 
CGACAACCAAATGGAATTATTAACCAATACCGAGTGAAAGTGCTAGTTCCAGAGACAGGA 
ATAATTTTGGAAAATACTTTGCTCACTGGAAATAATGAGTATATAAATGACCCCATGGCT 
CCAGAAATTGTGAACATAGTAGAGCCAATGGTAGGATTATATGAGGGTTCAGCAGAGATG 
TCGTCTGACCTTCACTCACTTGCTACATTTATATATAACAGCCATCCAGATAAAAACTTT 
CCTGCAAGGAATAGAGCTGAAGACCAGACTTCACCAGTTGTAACTACAAGGAATCAGTAT 
ATTACTGACATTGCAGCTGAACAGCTGTCTTATGTTATCAGGAGACTTGTACCTTTCACT 
GAGCACATGATTAGTGTATCTGCTTTCACCATCATGGGAGAAGGACCACCAACAGTTCTC 
AGTGTTAGGACACGTCAGCAAGTGCCAAGCTCCATTAAAATTATAAACTATAAAAATATT 
AGTTCTTCATCTATTTTGTTATATTGGGATCCTCCAGAATATCCCAATGGAAAAATAACT 
CACTATACGATTTATGCAATGGAATTGGATACAAACAGAGCATTCCAGATAACTACCATA 
GATAACAGCTTTCTCATAACAGGGTTAAAGAAATACACAAAATACAAAATGAGAGTGGCA 
GCCTCAACCCACGTTGGAGAAAGTTCTTTGTCTGAAGAAAATGACATCTTTGTGAGAACT 
TCAGAAGATGAACCGGAATCATCACCTCAAGATGTCGAAGTAATTGATGTTACCGCAGAT 
GAAATAAGGTTGAAGTGGTCACCACCCGAAAAGCCCAATGGGATCATTATTGCTTATGAA 
GTGCTATATAAAAATATAGATACTTTATATATGAAGAACACATCAACAACAGACATAATA 
TTAAGGAACTTAAGACCTCACACCCTCTATAACATTTCTGTAAGGTCTTACACCAGATTT 
GGTCATGGCAATCAGGTATCTTCTTTACTCTCTGTAAGGACTTCGGAGACTGTGCCTGAT 
AGTGCACCAGAAAATATCACTTACAAAAATATTTCTTCTGGAGAGATTGAGCTATCATTC 
CTTCCCCCAAGTAGTCCCAATGGAATCATACAAAAATATACAATTTATCTCAAGAGAAGT 
AATGGAAATGAGGAAAGAACTATAAATACAACCTCTTT^ 

AAGAAATATACCCAATATATCATTGAGGTGTCTGCTAGTACACTCAAAGGTGAAGGAGTT 

CGGAGTGCTCCCATAAGTATACTGACGGAGGAAGATGCTCCTGATTCTCCCCCTCAAGAC 

TTCTCTGTAAAACAGTTGTCTGGTGTCACGGTGAAGTTGTCATGGCAACCACCCCTGGAG 

CCAAATGGAATTATCCTTTATTACACAGTTTATGTCTGGAATAGATCATCATTAAAAACT 

ATTAATGTCACTGAAACATCATTGGAGTTATCAGATTTGGATTATAATGTTGAATACAGT 

GCTTATGTAACAGCTAGCACCAGATTTGGTGATGGGAAAACAAGAAGCAATATCATTAGC 

TTTCAAACACCAGAGGGAGCACCAAGCGATCCTCCCAAAGATGTTTATTATGCAAACCTC . 

AGTTCTTCATCAATAATTCTTTTCTGGACACCTCCTTCAAAACCTAATGGGATTATACAA 

TATTACTCTGTTTATTACAGAAATACTTCAGGTACTTTTATGCAGAATTTTACACTCCAT 

GAAGTAACCAATGACTTTGACAATATGACTGTATCCACAATTATAGATAAACTGACAATA 

TTCAGCTACTATACATTTTGGTTAACAGCAAGTACTTCAGTTGGAAATGGGAATAAAAGC 

AGTGACATCATTGAAGTATACACAGATCAAGACATACCTGAAGGGTTTGTTGGAAACCTG 

ACTTACGAATCCATTTCGTCAACTGCAATAAATGTAAGCTGGGTCCCACCGGCTCAACCA 

AACGGTCTAGTCTTCTACTATGTTTCACTGATCTTACAGCAGACTCCTCGCCATGTGAGA 

CCACCTCTTGTTACATATGAGAGAAGCATATATTTTGATAATCTGGAAAAATACACTGAT 

TATATATTAAAAATTACTCCATCAACAGAAAAGGGATTCTCTGATACCTATACTGCCCAG 

CTATACATCAAGACTGAAGAAGATGTCCCAGAAACTTCACCAATAATCAACACTTTTAAA 

AACCTTTCCTCTACCTCAGTTCTCTTATCATGGGATCCCCCAGTAAAGCCAAATGGTGCA 

ATAATAAGTTATGATTTAACTTTACAAGGACCAAATGAAAATTATTCTTTCATTACTTCT 

GATAATTACATAATATTGGAAGAGCTTTCACCATTTACATTATATAGCTTTTTTGCTGCC 

GCAAGAACTAGAAAAGGACTTGGTCCTTCCAGTATTCTTTTCTTTTACACAGATGAGTCA 

GTGCCGTTAGCACCTCCACAAAATTTGACTTTAATCAACTGTACTTCAGACTTTGTATGG 

CTGAAATGGAGCCCAAGTCCTCTTCCAGGTGGTATTGTTAAAGTATATAGTTTTAAAATT 

CATGAACATGAAACTGACACTATATATTATAAGAATATATCAGGATTTAAAACTGAAGCC 

AAACTTGTTGGACTGGAACCAGTCAGCACCTACTCTATCCGTGTATCTGCGTTCACCAAA 

GTTGGAAATGGCAATCAATTTAGTAATGTAGTAAAATTCACAACCCAAGAATCAGTTCCA 

GATGTCGTGCAGAATATGCAGTGCATGGCAACTAGCTGGCAGTCAGTTTTAGTGAAATGG 

GATCCACCCAAAAAGGCAAATGGAATAATAACGCAGTATATGGTAACAGTTGAAAGGAAT 

TCTACAAAAGTTTCTCCCCAAGATCACATGTACACTTTCATAAAGCTTCTTGCCAATACC 

TCATATGTCTTTAAAGTAAGAGCTTCAACCTCAGCTGGTGAAGGTGATGAAAGCACATGC 

CATGTCAGCACACTACCTGAAACAGTTCCCAGTGTTCCCACAAATATTGCTTTTTCTGAT 

GTTCAGTCAACTAGTGCAACATTGACATGGATAAGACCTGACACTATCCTTGGCTACTTT 

CAAAATTACAAAATTACCACTCAACTTCGTGCTCAAAAATGCAAAGAATGGGAATCCGAA 

GAATGTGTTGAATATCAAAAAATTCAATACCTCTATGAAGCTCACTTAACTGAAGAGACA 
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Pig. 7 (continued) 

GTATATGGATTAAAGAAATTTAGATGGTATAGATTCCAAGTGGCTGCCAGCACCAATGCT 
GGCTATGGCAATGCTTCAAACTGGATTTCTACAAAAACTCTGCCTGGCCCTCCAGATGGT 
CCTCCTGAAAATGTTCATGTAGTAGCAACATCACCTTTTAGCATCAGCATAAGCTGGAGT 
GAACCTGCTGTCATTACTGGACCAACATGTTATCTGATTGATGTCAAATCGGTAGATAAT 
GATGAATTTAATATATCCTTCATCAAGTCAAATGAAGAAAATAAAACCATAGAAATTAAA 
GATTTAGAAATATTCACAAGGTATTCTGTAGTGATCACTGCATTTACTGGGAACATTAGT 
GCTGCATATGTAGAAGGGAAGTCAAGTGCTGAAATGATTGTTACTACTTTAGAATCAGCC 
CCAAAGGACCCACCTAACAACATGACATTTCAGAAGATACCAGATGAAGTTACAAAATTT 
CAATTAACGTTCCTTCCTCCTTCTCAACCTAATGGAAATATCCAAGTATATCAAGCTCTG 
GTTTACCGAGAAGATGATCCTACTGCTGTCCAGATTCACAACCTCAGTATTATACAGAAA 
ACCAACACATTCGTCATTGCAATGCTAGAAGGACTAAAAGGTGGACATACATACAATATC 
AGTGTTTACGCAGTCAATAGTGCTGGTGCAGGTCCAAAGGTTCCGATGAGAATAACCATG 
GATATCAAAGCTCCAGCACGACCAAAAACCAAACCAACCCCTATTTATGATGCCACAGGA 
AAACTGCTTGTGACTTCAACAACAATTACAATCAGAAT 

GATCATGGACCAATAAAAAATGTACAAGTGCTTGTGACAGAAACAGGAGCTCAGCATGAT 
GGAAATGTAACAAAGTGGTATGATGCATATTTTAATAAAGCAAGGCCATATTTTACAAAT 
GAAGGCTTTCCTAACCCTCCATGTACAGAAGGAAAGACAAAGTTTAGTGGCAATGAAGAA 
ATCTAGATGATAGGTGCTGATAATGCATGCATGAT^ 

AATGGACCACTGAAACCAAAAAAGCAATACTTATTTAAATTTAGAGCTACAAATATTATG 
GGACAATTTACTGACTCTGATTATTCTGACCCTGTTAAGACTTTAGGGGAAGGACTTTCA 
GAAAGAACCGTAGAGATCATTCTTTCCGTCACTTTGTGTATCCTTTCAATAATTCTCCTT 
GGAACAGCTATTTTTGCATTTGCAAGAATTCGACAGAAGCAGAAAGAAGGTGGC^CATAC . 
TCTCCTCAGGATGCAGAAATTATTGACACTAAATTGAAGCTGGATCAGCTCATCACAGTG 
GCAGACCTGGAACTGAAGGACGAGAGATTAACGCGGTTACTTAGTTATAGAAAATCCATC 
AAGCCAATAAGCAAGAAATCCTTCCTGCAACATGTTGAAGAGC 

CTAAAGTTTCAAGAAGAATTTTCGGAATTACCAAAATTTCTTCAGGATCTTTCTTCAACT 
GATGCTGATCTGCCTTGGAATAGAGCAAAAAACCGCTTCCCAAACATAAAACCATATAAT 
AATAACAGAGTAAAGCTGATAGCTGACGCTAGTGTTCCAGGTTCGGATTATATTAATGCC 
AGCTATATTTCTGGTTATTTATGTCCAAATGAATTTATTGCTACTCAAGGTCCACTACCA 
GGAACAGTTGGAGATTTTTGGAAAATGGTGTGGGAAACCAGAGCAAAAACATTAGTAATG 
CTAACACAGTGTTTTGAAAAAGGACGGATCAGATGCCATCAGTATTGGCCAGAGGACAAC 
AAGCCAGTTACTGTCTTTGGAGATATAGTGATTACAAAGCTAATGGAGGATGTTCAAATA 
GATTGGACTATCAGGGATCTGAAAATTGAAAGGCATGGGGATTGCATGACTGTTCGACAG 
TGTAACTTTACTGCCTGGCCAGAGCATGGGGTTCCTGAGAACAGCGCCCCTCTAATTCAC 
TTTGTGAAGTTGGTTCGAGCAAGCAGGGCACATGACACCACACCTATGATTGTTCACTGC 
AGTGCTGGAGTTGGAAGAACTGGAGTTTTTATTGCTCTGGACCATTTAACACAACATATA 
AATGACCATGATTTTGTGGATATATATGGACTAGTAGCTGAACTGAGAAGTGAAAGAATG 
TGC^TGGTGCAGAATCTGGCACAGTATATCTTTTTACACCAGTGCATTCTGGATCTCTTA 
TCAAATAAGGGAAGTAATCAGCCCATCTGTTTTGTTAACTATTCAGCACTTCAGAAGATG 
GACTCTTTGGACGCCATGGAAGGTGATGTTGAGCTTGAATGGGAAGAAACCACTATG 
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Fig. 8 

tttactgctt aaaatctagg tgggtgtgag gccatgactg tgaacgttgt cagtcactga 
ctgtattcag ctgtaaaggc ttgacaaggt gagaaaaacc tcccacactt ttaaaatgaa 
tgaaatgttt gaacactaag tagtcataca caaaataatt accttaattt catatgttgt 
tccaggatta agattagtaa gaagaacttc cagactgtct ggctttgatc tgacttgtgt 
atatactgtt tgcatccacg gacaaacttc cttatatttg acaatgtaag atataatcct 
tccatttgga ttaggtggtg tattccaaga cagaagaatc ccagcagatc cgactctttc 
cccggctagg aagactggag gccctggttc tatttttaaa aaatccattt tg 



Pig. 9 

Q: 1 l^FLIIFLLLFIGXSEXQVDVSNVVPGXRYDIXISSISXX-YXSPVXRIVXXN^KPGPP 

MDF . FL.L.IG.SE.QVDVS: ..G. YDI.:SS:S.. Y. SPV.R. : . .NV.KPGPP 
H : 2 MDFHFS FLFLL IGTSESQVDVSSSFDGTGYDITLS S VSATTYSSPVSRTLATNVTKPGPP 

VFLAGERVGSAGILLSWNTPPNPNGRI IS YIVKYKEVCPWMQTVYTQVRSKPDSLEVLLT 
VFLAGERVGS AGI LLS WNTP PNPNGRI I S Y : VKYKEVCPWMQT . YT : . R : KPDSLEVLLT 
VFLAGERVGSAGILLSWNTPPNPNGRI I S YVVKYKEVCPWMQTAYTRARAKPDSLEVLLT 

NLNPGTTYE I KVAAENS AGI GVFSD PFLFQTAE S APGKWNLTVEAYNAS AVKL I WYLPR 
NLNPGTTYE I KVAAEN : AGIGWSDPFLFQTAESAPGKVVNLTVEA . N SAV. LI WYLPR 
NLNPGTTYE IKVAAENNAGIGVFSDPFLFQTAE S APGKVVNLTVEALNYSAVNLI WYLPR 

QPNGKITSFKISVKHARSGIWKDVSIRVEDILTGKLPESNENSESFLWSTASPSPTLGR 
QPNGKI TS FKI S VKHARSGI WKDVS : RVEDIL : GKLPE .NENSESFLWST . SPSPTLGR 
QPNGKITSFKISVKHARSGIVVKDVSLRVEDILSGKLPECNENSESFLWSTTSPSPTLGR 

VTPPSRTTHSS S TLTQNE I SS VWKEP I S F WTHLRP YTTYLFEVSAATTEAGYIDS TI VR 
VTP. RTT.SSST. . : : : ISS VWKEPISF WTHLRP YTTYLFEVSA. TTEAGYIDSTIVR 
VTPTVRTTQS S STAARSKI S S VWKE PIS FVVTHLRP YTTYLFEVSAVTTEAGYIDS TI VR 

TPESVPEGPPQNCVTGNITGKSFSILWDPPTIVTGKFSYRVELYGPSGRILDNSTKDLKF 
TPESVPEGPPQNC : . GN : TGK : FSI WDPPTIVTGKFSYRVELYGPSGRILDNSTKDL : F 
TPESVPEGPPQNCIMGNVTGKAFSISWDPPTIVTGKFSYRVELYGPSGRILDNSTKDLRF 

AFTNLTPFTMYDVY I AAETSAGTGPKSNI SVFTPPDVPGAVFDLQLAEVES TQVRI TWKK 
AFT : LTPFTMYDVY : AAETSAG . GPKSN : SVFTPPDVPGAVFDLQ : AEVE : T : : RITW : K 
AFTHLTPFTMYDVWAAETSAGVGPKSNLSWTPPDVPGAVFDLQIAEVEATEIRITWRK 

PRQPNGIINQYRVKVLVPETGIILENTLLTGNNEYINDPMAPEIVNIVEPMVGLYEGSAE 
PRQPNGII : QYRVKV V ETG: :LENTLLTG. :E I : :PM: PEI :N:V:PM:G. YEGS .E 
PRQPNGI I SQYRVKVSVLETGVVLENTLLTGQDES ISNPMSPEIMNLVDPMIGFYEGSGE 



MS SDLHSLATFI YNSHPDKNFPARNRAEDQTS P WTTRNQ YI TD I AAEQLS YVI RRLVPF 
MSSDLHS A : FIYNSHP . . : FPA . . RAE : Q : SP WTTRNQ Y : TDI . AEQLSYV : RRLVPF 
MSSDLHSPASFIYNSHPHNDFPASTRAEEQSSPWTTRNQYMTDITAEQLSYWRRLVPF 

TEHMI S VS AFTIMGEGP PTVLS VRTRQQVPS S IKI INYKNI S S S S ILLYWDP PE YPNGKI 
TEH . IS VSAFTIMGEGPPTVL : VRTR : QVPSSI : I INYKNI SSSSILLYWDPPE YPNGKI 
TEHTI S VS AFTIMGEGPPTVLTVRTREQVPSS IQI INYKNI SS SS ILLYWDPPE YPNGKI 

THYTIYAMELDTNRAFQITTIDNSFLITGLKKYTKYKMRVAASTHVGESSLSEENDIFVR 
THYTIYA . ELDTNRAFQ : TT : DNS FLI TGLKKYT : YKMRVAASTHVGESSIiSEENDIFVR 
TH YTI YATELDTNRAFQMTTVDNS FLITGLKKYTRYKMRVAASTHVGES SLSEEND I FVR 

TSEDEPESSPQDVEVIDVTADEIRLKWSPPEKPNGIIIAYEVLYKNIDTLYMKNTSTTDI 
T.EDEPESSPQDV:V. .V: . . E : RLKWSPPEKPNGIIIAYEVLY : N . DTL : : KNTSTTDI 
TPEDEPESSPQDVQVTGVSPSELRLKWSPPEKPNGI I IAYEVLYQNADTLFVKNTSTTDI 
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Fig. 9 (continued). 

ILRNLRPHTLYNISVRSYTRFGHGNQVSSLLSVRTSETVPDSAPENITYKDJISSGEIELS 
I : . : L : P : TLYNIS : RSYTR . GHGNQ SSLLSVRTSETVPDSAPENITYKNISSGEIE : S 
1 1 SDL KP YTLYNI S IRS YTRLGHGNQS S S LL S VRTS ETVPD S APENI TYKNT S SGE I E I S 

FLPPSSPNGIIQKYTIYLKRSNGNEERTINTTSLTQNIKVIiKKYTQYIIEVSASTLKGEG 
FLPP . SPNGIIQKYTIYLKRSN. : E . RTINTTSLTQ . I LKKYT . Y : IEVSASTLKGEG 
FLPPRSPNGIIQKYTIYLKRSNSHEARTINTTSLTQTIGGLKKYTHYVIEVSASTLKGEG 
VRSAPISILTEEDAPDSPPQDFSVKQLSGVTVKLSWQPPLEPNGIILYYTVYVWNRSSLK 
: RS . PISILTEEDAPDSPPQ : FSVKQLSGVTV. LSWQPPLEPNGIILYYTVYVW : : SSL : 
IRSRPISILTEEDAPDSPPQNFSVKQLSGVTVMLSWQPPLEPNGIILYYTVYVWDKSSLR 

TINVTETSLELSDLDYlsrVEYSAYVTASTRFGDGKTRSNIISFQTPEGAPSDPPKDVYYAN 
.IN.TB.SL LSDLDYNV : Y . A VTASTRFGDG . . RS : II : F : TPEG . PSDPP . DV : Y . N 
AINATEASLVLSDLDYNVDYGACVTASTRFGDGNARS S I INFRTPEGEPSDPPNDVHYVN 

LSS S S I ILFWTPPSKPNGI IQYYSVYYRNTSGTFMQNFTLHEVTNDFDNMTVSTI IDKLT 
LSSSSIILFWTPP KPNGIIQYYSVYY :NTSGTF :QNFTL :VT. : DN:TVS. I :L. 
LSSSSIILFWTPPVKPNGIIQYYSVYYQNTSGTFVQNFTLLQVTKESDNVTVSARIYRLA 

IFSYYTFWLTASTSVGNGNKSSDIIEVYTDQDIPEGFVGNLTYESISSTAINVSWVPPAQ 
IFSYYTFWLTASTSVGNGNKSSDII . VYTDQDIPEG VGNLT : ESISSTAI : VSW PP:Q 
IFSYYTFWLTASTSVGNGNKSSDIIHVYTDQDIPEGPVGNLTFESISSTAIHVSWEPPSQ 

PNGLVFYYVSLILQQTP -RHVRPPLVTYERS I YFDNLEKYTDYILKITPSTEKGFSDTYT 
PNGLVFYY : SL LQQ:P RH: PPLVTYE.SI FD : LEKYTDYI . KITPSTEKGFS : TYT 
PNGLVFYYLSLNLQQSPPRHMIPPLVTYENSIDFDDLEKYTDYIFKITPSTEKGFSETYT 

AQLYI KTEEDVPETS P I INTFKNLSSTS VLLS WDPPVKPNGAI I SYDLTLQGPNENYS F I 
. QL : IKTEEDVP : T . PIINTFKNLSSTS : LLSWDPP : KPNGAI : . Y . LTLQGP : . N : : F : 
TQLHIKTEEDVPDTPPI INTFKNLSSTS ILLSWDPPLKPNGAILGYHLTLQGPHANHTFV 

TSDNYIILEELSPFTLYSFFAAARTRKGLGPSSILFFYTDESVPLAPPQNLTLINCTSDF 
TS . N : I : LEELSPFTLYSFFAAART . KGLGPSSILFFYTDES . PLAPPQNLTLIN TSDF 
TSGNHIVLEELSPFTLYSFFAAARTMKGLGPSSILFFYTDESAPLAPPQNLTLINYTSDF 

WLKWSPSPLPGGIVKVYSFKIHEHETDTIYYKNISGFKTEAKLVGLEPVSTYSIRVSAF 
VWL.WSPSPLPGGIVKVYSFKIHEHETDT: : YKNISG . : T : AKL GLEPVSTYS : .VSAF 
VWLTWS PS PLPGGI VKVYS FKIHEHETDTVFYKNI SGLQTDAKLEGLEPVSTYS VS VS AF 



TKVGNGNQFSNVVTCFTTQESVPDWQNMQCm^ 

TKVGNGNQ : SNW : FTTQESVP : .V:N: :C:A. .WQSV V: WDPP : K.NGII . . YM : TV 
TKVGNGNQYSNVVEFTTQESVPEAVRNIECTARDWQSVSVRWDPPRK?IWGIIIHYMITVG 

RNSTK^SPQDHMYTFIKLLANTSYVFKVRASTSAGEGDESTCHVSTLPETVPSVPTNIAF 
NSTKVSP : D . YTF . KLL . NTSYVF : VRASTSAGEG : ES . C i : STLPETVPS . PTN'; AF 
GNSTKVSPRDPTYTFTKLLPNTSYVFEVRASTSAGEGNESRCDISTLPETVPSAPTNVAF 

SDVQSTSATLTWIRPDTILGYFQNYKITTQLRAQKCKEWESEECVEYQKIQYLYEAHLTE 
S : VQSTSATLTW . : PDTI . GYFQNYKI TTQLRAQKC : EWE . EEC : E : QK QYLYEA : TE 
SNVQSTSATLTWTKPDTIFGYFQNYKITTQLRAQKCREWEPEECIEHQKDQYLYEANQTE 

ETVYGLKKFRWYRFQVAMTNAGYGNASNWISTKTLPGPPDGPPENVHVVATSPFSISIS 
ETV : GLKKFRWYRFQVAASTN . GY . NAS . WIST : TLPGPPDGPPENVHWATSPF . I : IS 
ETVHGLKKFRWYRFQVAASTNVGYSNASEWISTQTLPGPPDGPPENVHWATSPFGINIS 
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Fig. 9 (continued) 

WS KPAVI TGPTCYL ID VKS VDNDEFNI S F I KSNEENKT I E IKDLE I FTRYS WITAFTGN 
WSEPAVITGPT YLIDVKSVD :D : FNISF : KSNEENKT . EI . : LE : FTRYSWITAF . GN 
WSEPAVITGPTFYLIDVKSVDDDDFNISFLKSNEENKTTEINNLEVFTRYSVVITAFVGN 

<ATP/GTP-binding site motif> 

I S AAYVEGKS S AEMI VTTLESAPKDPPNNMTFQKI PDEVTKFQLTFLPPS QPNGNI QVYQ 
: S . AY . : GKSSAE : I : TTLES . PKDPPNNMTFQKIPDEVTKFQLTFLPPSQPNGNI : VYQ 
VSRAYTDGKSSAEVIITTLESVPKDPPNNMTFQKIPDEVTKFQLTFLPPS QPNGNI RVYQ 

ALVYREDDPTAVQIHNLSI I QKTNTFVIAMLEGLKGGHTYNI S VYAVNS AGAG PKVPMRI 
ALVYREDDPTAVQIHN . S I IQKT : T : I AMLEGLKGGHTYNI SVYA : NSAGAGPKV . MRI 
ALVYREDDPTAVQIHNFSI IQKTDTSI IAMLEGLKGGHTYNISVYAINSAGAGPKVQMRI 

TMD I KAP ARPKTKPTP I YDATGKLL VTS TTI T I RMP I C YYSDDHGP I KNVQ VLVTE TGAQ 
TMDIKAPARPK : KP . PI D ATGKLLVT STT IT IRMP I C YY : DDHGPI : NVQVLV . ETGAQ 
TMDIKAPARPKSKPIPIRDATGKLLVTSTTITIRMPICYYNDDHGPIRNVQVLVAETGAQ 

HDGNVTK>OT)AYFNKARPYFTNEGFPNPPCTEGKTKFSGNEEIYIIGADNACMIPGNEDK 
. DGNVTKWYDAYFNKARPYFTNEGFPNPPC . EGKTKF S GNEE I Y : IGADNACMIPGNE : K 
QDGNVTKWYDAYFNKARPYFTNEGFPNPPCIEGKTKPSGNEEIYVIGADNACMIPGl^ 

< transmembrane , region > 
I OTGPLKPKKQYLFKFRATNIMGQFTDSDYSDPVKTLGEGLSERTVEI ILSVTLCILS 1 1 
ICNGPIiKPKKQYLFKFRATN : MGQFTDS : YSDP : KTLGEGLSERTVEI ILSVTLCILSI I 
ICNGPIiKPKKQYLFKFRATNVMGQFTDSEYSDPIKTLGEGLSERTVEIILSVTLCILSII 

LLGTAIFAFARIRQKQKEGGTYSPQDAEIIDTKLKLDQLITVADLELKDERLTR^ 

LLGTAIFAF.RIRQKQKEGGTYSP^AEIIDTK.IOjDQLITVACLELKDERLTRLLSYRK 

LLGTAIFAFVRIRQKQKEGGTYSPRDAEIIDTKFKLDQLITVADLELKDERLTRLLSYRK 

SIKPISKKSFLQHVEELCTNNNLKTQEEFSELPKFLQDLSSTDADLPWN^ 
SIKPISKKSFLQHVEELCTN : NLKFQEEFSELPKFLQDLSSTDADLPWNRAKNRFPNIKP 
SIKPISKKSFLQHVEELCTNSNLKFQEEFSEIiPKFIiQDLSSTDADLPWNRAKNRFPNIKP 

YNNNRVKLIADASVPGSDYINASYISGYLCPNEFIATQGPLPGTVGDFWKMVWETR^ 
YNNNRVKLIAD . S : PGSDYINASY : SGYLCPNEFIATQGPLPGTVGDFW : MVWETR . KTL 
YNimRVKLIADVSLPGSDYINASYVSGYLCPNEFIATQGPLPGTVGDFWRMVVJETRTKTIj 

VMLTQCFEKGRIRCHQYWPEDNKPVTVFGDIVITKLMEDVQIDWTIRDLKIERHGDCMW 
VMLTQCFEKGRIRCHQYWPEDNKPVTVFGDIVITKLMED : QIDWTIRDLKIERHGDCMTV 
VMLTQ CFEKGRIRCHQYWPEDNKPVTVFGDI VITKLMED IQ IDWT IRDLKIBRHGDCMTV 

<Tyrosine phosphatase signature: C is the active site residue> 
RQCNFTAWPEHGVPENSAPLIHFVTCLVTIASRAHDTC 

RQCNFT . WPEHGVPEN : . PLIHFVKLVR . SRAHDTTPM : VHC S AGVGRTGVFI ALDHLTQ 
RQCOTTGWPEHGVPENTTPLIHFVKLVRTSRAHDTTP 

HINDHDFTOIYGLVAELRSERMC^QNLAQYIFLHQCILDLLSNKGSNQPICFVWSALQ 
HIN : KDFVD I YGIiVAELRSERMCMVQNLAQYI FLHQC I LDLLSNKG . : QP : CFVNYS . LQ 
HINNHDFVD I YGLVAELRSERMCMVQNLAQYI FLHQCILDLLSNKGGHQPVCFVNYSTLQ 



KMDSLDAMEGDVELEWEETTM 
KMDSLDAMEGDVELEWEETTM 
KMDSLDAMEGDVELEWEETTM 



2299 
2302 
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Fig. 10 

Q: 1991 HVEELCTNIJNLK^QEEFSELPKFLQDLSSTDA 

::E.L . N : , LKF , : E : , . : , . Q. . : N: .KNR: .N; .Y:::RVL.: 

H: 8 NIERLKA1TOGLKFSQEYESIDPG-QQFTWENSNLEVNKPKNRYAWIAYDHSRVILTSID 

SVPGSDYINASYISGYLCPNEFIATQGPLPGTVGDFWKMVWETRAKTLVI^LTQCFEKGRI 
. VPGSDYINA : YI . GY . N . : I ATQGPLP T : GDFW : MWE . R . . T : VM : T : . EK.R: 
GVPGSDYINAl^lDGYRKQNAYIATQGPLPETMGDFWRMVWEQRTATVVMMTRIiEEKSRV 

RCHQ YWPEDNKPVTVFGD I VI TKLMEDVQ I -DWTIRDLKIERHG- - DCMTVRQCNFTAWP 
rC.QYWP. : . , . G I :T L : : . V : : . :T:R. . . : . : .G : . . :RQ .F.AWP 
KGDQYWPA- -RGTETCGLIQVT-LLDTVELATYTVRTFALHKSGSSEKRELRQFQFMAWP 

EHGVPENSAPLIHFVKLVRASRAHDTTPMIVHCSAGVGRTGVFIALDHLTQHINDHDFVD 

:HGVPE . .P:: F: : V:A. . . D. PM: VHCSAGVGRTG . FI . :D VD 

DHGVPEYPTPILAFLRRVKACOTLDAGPMVVHCSAGVGRTGCF^ 

IYGLVAELRSERMCMVQNLAQYIFLHQCIL^ 

IYG V. :RS:R MVQ. QY: F :H: , : L : . . : :. P. : Y: :QK: . . : . . E 
IYGHVTCMRSQRimiVQTEDQYVFIHEALLEAATCGHTEVPARNl-YAHIQKLGQVPPGE 

GDVELEWE 2295 
. . . :E E 

SVTAMELE 310 



WO 03/033688 



PCT/EP02/11473 



- 13/27 - 



Fig. 11 

Q: 2022 ADLPWNRAKNRFPNIKPYN^^ 

A: LP N: KNR. NI.PY. .RV h . . V . GSDYINAS : : . GY . . . : IATQGPL . 
H : 323 ANLPCNKFKNRLVNIMPYELTRVCLQPIRGVEGSDYINASFLDGYRQQKAYIATQGPLAE 

TVGDFWKMVWETRAKTLVMLTQCPEKGRIRCHQYWPEDNKPVTVFGDIVITKIiMED 

DFW:M:WE . : . . :VMLT: . E.GR :CHQYWP. : . . . : .V: . . : .E . . 
STEDFWRMLWEHNSTIIVMLTKLREMGREKCHQYWPAERS - -ARYQYFVVDPMAEYNMPQ 

WTIRDLKI - -ERHGDCMTVRQCNFTAWPEHGVPENSAPLIHFVKLVRASRAH- -DTTPMI 
:.:R:.K: .R.G...T:RQ . FT WPE . GVP : . . . .I.F: V..::.. p : . 

YILREFKVTDARDGQSRTIRQFQFTDWPEQGVPKTGEGFIDFIGQVHKTKEQFGQDGPIT 

VHCSAGVGRTGVFIALDHLTQHI1TOHDFVDIYGLVAELRSERMCMVQNLAQYIFLHQCIL 
VHCSAGVGRTGVFI . L . ...VD:: .V. .LR: :R .MVQ. QY . . : : . .L 

VHCSAGVGRTGVFITLSIVLEROTYEGVVDMFQTVKTLRTQRPA1WQTEDQYQLCYRAAL 

DLLSN 2262 
EYLGS 565 
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Fig. 12 

Q: 2027 NIUUCNRFPNIKPYNNWRVKLIadasvPGSDYINASYISG YLCPNEFIATQGP 

N: .KNR: . :I PY : : . RVKL . GSDYINA : YI . G Y P..:IATQGP 

H : 1 NkeKNRykdilpyDhtRVkLk egsDYINAnyidgpqsrt lvvykkpkaylaTQgP 

IiPGTVGDFWKMVWETRAKTLVMLTQCFEKGR- -IR-CHQYWPE DNKPVTVFGDIVIT 

LP.T: DFW:MVWE. : . . . rVMLT: . .EKGR : CQYWPE :. . . :GDI. :T 
IpnTieDFWrMvweqkvtvIVMLtklvEkgreeeKtCaqYWPeeggeeggsltyGditVt 

KLMEDVQIDWTIRDLKIERhGDC MTVRQCNFTAWPEHGVPEN- SAP 

:.E: . D:T:R.L : : . . GD .TV.. : : T . WP : HGVPE : ... 

lvseekdddytvRtlel tn . gdeskneakqeklkgetrtVthyhytgWPDhgvPesgpks 

LIHFVKLVRASRA HDTT PM I VHCS AGVGRTGVF I ALDHLTQHIN 

L:.F:: VR.S:: D: P: : VHCS AG : GRTG . FIA:D L.Q. : . 

HdflrkvrksqsqenesgaaspdsgprSsctaPivVHCSAGiGRTGtfiaidillqqle 

DHDFVDIYGLVAELRSERMCMVQNLAQYIFLHQCILD 2258 

. . .VDI: . :V.ELRS:R MVQ . . QYIF : : . . IL : 
aegwdifdivkelRsqRpgmVQteeQYiFiydaile 2 71 
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Fig. 13 

ATP_GTP_A region, 
region prediction. 



transmembrane segments, and fibronectin type III domain 



High scoring transmembrane segments: 
From 1906 to 1928: length= 23, score=69.00 ** 
1906 IILSVTLCIL SIILLGTAIF AFA 
L: 5(21.7%); A: 3(13.0%); I: 6(26.1%); 



Fibronectin type III domain region 
from 1641 to 1734. Source: [pf amsearch] 



from 153 8 to 1620. Source : 
from 1430 to 1526. Source : 
from 1340 to 1418. Source: 
from 1243 to 1328. Source: 
from 115 0 to 1231. Source: 
from 1053 to 1134. Source: 
from 948 to 1040. Source: 



from 854 to 93 5 
from 759 to 842 
from 665 to 746 
from 569 to 651 
from 397 to 558 
from 3 05 to 3 86, 
from 155 to 291. 
from 56 to 143. 



Source : 
Source : 
Source : 
Source : 
Source : 
Source : 
Source : 
Source : 



[pfamsearch] 
[pf amsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 



[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
[pfamsearch] 
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Fig. 14 

AF063249_1 2 MDPHFSFLFLLIGTSESQ VDVSSSFD 

MDF + FL+L IGTSE+Q VDVS+ 

MDFL 1 1 FLLLFIGTSETQ VDVSNWP 

chrl2 : 88900001-32923 agtcaatctctagatgacGTATTTC Intron 1 TAGgggtaggc 

tattttttttttgccaca<0 [32977:33317] -0>tatcattc 

gtttctttatttgtagag tttctctt 

AF063249_1 28 GTGYD I TLS S VS ATTYS S PVSRTLATNVT 

GT YDIT+SS+S TTY+SPV+R + TNVT 

GTR YD ITISSIS- TT YTS PVTR I VTTNVT 
chrl2: 88900001-33342 gaatgaaattat aatatcgaaagaaaga 

gcgaatctcctc ccaccctcgttccatc 

ttgctacctatt aaccatttaagaataa 

AF063249_1 57 PGPPVFLAGERVGS AG I LLS WNT 

PGPPVFLAGERVGSAGILLSWNT 
K : K [aaa] PGPPVFLAGERVGSAGILLSWNT 
chrl2 : 88900001-33426 AGTGAGTA Intron 2 TAGAAcgccgtcgggaggtggaccttaa 

<1 [33427:34067] -1> cgcctttcgagtgccgtttcgac 

agtaccacgaacattgttgtgta 

AF06324 9JL 81 PPNPNGRIISYVVXYKEVCPWMQTAYTRARAKPDSLEVLLTNLNPGTTY 

PPNPNGRIISY+VKYKEVCPWMQT YT+ R+KPDSLEVLLTNLNPGTTY 
PPNPNGRI IS YI VKYKEVCPWMQTVYTQVRSKPDSLEVLLTNLNPGTTY 

chrl2 : 88900001-34139 ccacagaaattagataggtctacagtacgatacgacggccaacacgaat 

ccacaggttcattaaaatgcgtactacatgcacagtatttcatacgcca 
attatagtatctcatgattgggaaataacaagactgattttttttaaat 

AF063249_1 13 0 EIK VAAENNAGIGVFSDPFLFQTAES 

EIK VAAEN+AGIGVFSDPFLFQTAES 
EIK VAAENSAGIGVFSDPFLFQTAES 

chrl2 : 88900001-34286 gaaGTAATTA Intron 3 TAGggggaaggaggtagctctcagga 

ata<0 [34295 : 65280] -0>tccaagcgtgttgactttaccag 

atg tttacttctagtttatccataat 



AF063249_1 156 PGKVVNLTVKALNYS AVNL I WYL 

PGKWNLTVEA N SAV LI WYL 
A : A [gc t ] PGKVVNLTVEAYNASAVKL I WYL 

chrl2 : 88900001-65350 GGTAATTT Intron 4 CAGCTcgaggacagggtagtggacattt 

<1 [65351 : 66989] -1> cgattatctacaaccctattgat 

aaaggtcatgccctaatggtgta 



AF0 6324 9_1 180 PRQPNGKITS FKI S VKHARSGI WKDVSLRVED ILSGKLPECN 

PRQPNGKITS FKI SVKHARSG I VVKDVS +RVEDIL+GKLPE N 
PRQPNGKITS FKI S VKHARSGI VVKDVSIRVED I LTGKLPESN 

chrl2 : 8890 0001-67061 ccccagaaaataaagacgaagaggaggtaagggatagatcgaa 

cgacagatcgtatgtaacgggtttaatctgtaattcgatcaga 
tgaatcatcccgttcgtcatgaagatcacaagctgtgagaact 
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Fig. 14 (continued) 

AF06324 9__1 223 ENSESFLWSTTSPSPTLGRVTPTVRT 

ENSESFLWST SPSPTLGRVTP RT 
ENSESFLWSTASPSPTLGRVTPPSRT 

chrl2: 88900001-67190 GTAAGTA Intron 5 TAGgaagttttaagactcacgagacctca 

<0 [67190:68356] -0>aagacttggccgcccctggtccccgc 

gttattagtaccttacttatatagtc . 

AFO 63 24 9_1 249 TQSSSTAARSKISSWKEPISFVVTHLRPYTTYIjFEVSAVTTEAGYIDS 

T SSST + + + 1 S S VWKEP I S F WTHLRPYTTYL FEVS A TTEAGYIDS 
THSSSTLTQNEISSVWKEPISFVVTHLRPYTTYLFEVSAATTEAGYIDS 

chrl2 : 88900001-68435 acttaatacagaatgtagcaatggactactaatctggtggaagggtaga 

caccgctcaaatgctgaactgtttcatgcaccattatcccccacgatag 
ataacggagtgcctggagtcttagacgattaatttatattataattttt 



AF063249_1 298 TIVRTPES VPEGPPQNCIMGNVT 

TIVRTPES G N + 

TIVRTPES -:A[gcc]* . ^ TAAGTQANRVWSRPL 

chrl2 : 88900001-68582 aagaacgtGGTATGGT Intron 6 CAGCCagggacgaagtaacc 

cttgccac <1 [68607 : 70433] -1> cccgcacagtgggct 

gtcaaaaa ctttcgtcgcgtata 

AF063249_1 321 GKAFSISTOPPTIVTGKFSYRVELYGPSGRILDNSTKDLRFAFTHLTPF 

+ + + P ++ GK++ R +++ TK+ H P 

ANSNRPAAEGP J LLGGKLTNRKDIH- -. TKNPSVHHHHQRPK 

chrl2 ; 88900001-70481 gataacggggc2ctggacaaaagac aaactgcccccaca 

cacagcccagc ttggatcagaata caacctaaaaagca 
acccataggtt gaaaaaacagccc cacatatcttaaaa 



AF063249_1 370 TMYDVYVAAETSAGVGPKSNLSVFTPPDVP 

+ + S G S PP 

VDKTTKMGKKQSRKTGNSKKQSTSPPPKDG 

chrl2 : 88900001-70597 ggaaaaagaacaaaagataacaatcccagg 

taaccatgaaaggacgacaaagcccccaag 
atacagggaagcaatactaggcctttagtc 



AF0 63249_1 40 0 AVFDLQIAEVEATEIRITWRKPR 

AVFDLQ+AEVE+T++RITW+KPR 
G : R [agg] AVFDLQLAE VE STQVRI TWKKPR 

chr!2 :88900001-70687 AGTTCCTC Intron 7 CAGGGggtgtccggggtacgaaataacc 

<1 [70688:72736] -1> cttatatcataccatgtcgaacg 

agttaatagaacgaaattggaaa 



AF063249_1 424 QPNGI ISQYRVKVSVLETGWLENTLLTGQDE 

QPNGI I+QYRVKV V ETG++LENTLLTG +E 
QPNG I INQYRVKVL VPETG I ILENTLLTGNUE 

chrl2 : 88900001-72808 ccagaaactcgagcgcgagaatgaatcagaag 

acagttaaagtatttcacgtttaacttcgaaa 
aatattcacagagatagaaatgattgctattg 
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Pig. 14 (continued) 

AF063249_1 456 • SISNPMSPEIMNLVDPMIGFYEGSGE 

I++PM+PEI+N+V+PM+G+YEGS E 
YINDPMAPEIVNIVEPMVGLYEGSAE 
chr!2 : 88900001-72904 GTATTGC Intron 8 CAGtaagcagcgagaaggcaggttggtgg 

<0 [72904 : 84426] -0>ataactccattattacttgtaagcca 

tatccgtaatgcaagagaaatgtaag 

AF063249_1 482 MS SDLHSPASF I YNSHPHNDFPASTRAEEQS S PV 

MSSDLHS A+FIYNSHP +FPA RAE+Q+SPV 
MSSDLHSLATFIYNSHPDKNFPARNRAEDQTSPV 

chrl2 : 88900001-84505 attgcctcgatataaccgaatcgaaagggcatcg 

tccatactccttaagacaaatccgagcaaaccct 
ggtctcattatatcctatacttagtatacgtaat 

AF063249JL 516 TTRNQ YMTD I T AE QLS YWRRLV 

TTRNQY+TDI AEQLSYV+RRLV 
V: V [gta] TTRNQYITDIAAEQLSYVIRRLV 
chrl2 : 88900001-84607 GGTAGGTA Intron 9 CAGTAaaaactaagagggccttgaaacg 

<1 [84608 : 86084] -1> ccgaaatcatccaatcattggtt 

tagtgtttctataggtttcgata 



AF06324 9_1 54 0 PFTEHTISVSAFTIMGEGPPTVLTVRTREQ 

PFTEH ISVSAFTIMGEGPPTVL+VRTR+Q 
PFTEHMISVSAFTIMGEGPPTVLSVRTRQQ 

chrl2 : 88 900001-86156 ctagcaaagtgtaaagggccagcagaaccc 

ctcaattgtcctcttgagcccttgtgcgaa 
tctgcgttattcccgaaaaaatcttgatga 



AF063249_1 570 PSSIQIINYKNISSSSILLYWDP 

PSSI+IINYKNISSSSILLYWDP 
V:V[gtg] PSSIKIINYKNISSSSILLYWDP 
chrl2 :88900001-86246 GGTAAGGA Intron 10 AAGTGcataaaaataaaatttattttgc 

<1 t- [86247:86343] -1> cgctattaaaatgccctttagac 

acctatactattttattgatgtt 



AF063249_1 5 94 PE YPNGKITHYTI YATELDTNRAFQMTTVDNS FLI T 

PE YPNGKI THYT I YA ELDTNRAFQ+TT+DNSFLIT 
PEYPNGKITHYTIYAMELDTNRAFQITTIDNSFLIT 

chrl2 : 88900 001-86415 cgtcagaaactaatgagtgaaagtcaaaagaatcaa 

caacagatcaactactatacagctatcctaagtttc 
aatctaaatctgttagagtacaacgatcatcctcaa 



AF063249_1 630 - LKKYTRYKMRVAASTHVGESSLS 

LKKYT+ YKMRVAAS THVGE S S LS 
G : G [ggg] LKKYTKYKMRVAAS THVGE SSLS 

chrl2 : 88900001-86523 GGTAGAAA Intron 11 TAGGGtaataataaagggtacgggattt 

<1 [86524 : 87150] -1> taaacaaatgtccccatgagctc 

agacaacagagacacctaattgt 



WO 03/033688 



PCT/EP02/11473 



- 19/27 - 



Fig. 14 (continued) 

AF063249JL 654 EENDIFVRTPED PESSPQDVQVT 

EENDIFVRT ED PESSPQDV+V 
EENDIFVRTSED E : E [gaa] PESSPQDVEVI 

chr!2: 88900001-87222 ggagatgaatggGGTAAGAA Intron 12 TAGAAcgttccgggga 

aaaatttgccaa <1 [87259 : 87401] -1> cacccaatatt 

aatcctgataat gaaatatcaat 

r 

AFO 6 3 2 4 9JL 678 GVS PS ELRLKWS PPEKPNGI 1 I AYEVLYQNADTLFVKNTSTTD III SDL 

V+ E +RLKWS PPEKPNG 1 1 IAYEVL Y+N DTL++KNTSTTDII+ +L 
DVTADEIRLKWSPPEKPNGIIIAYEVLYKNIDTLYMKNTSTTDIILRNL 

chrl2 : 88900001-87437 ggagggaatattccgacagaaagtggctaaagattaaaataagaataat 

atccaatgtagcccaacagtttcaattaaatactataaccccatttgat 
ttcataaggggaacagctgcttttagatatattatggcaaaacaaagca 

AF06324 9_1 727 KPYTLYNISIRSYTRLGHGNQSSSLLSVRTSET 

+P+TLYNIS+RSYTR GHGNQ SSLLSVRTSET 
RPHTLYNISVRSYTRFGHGNQVSSLLSVRTSET 

chrl2 : 88900001-87584 accactaatgattaatgcgacgtttctgaatga 

gcactaatctgcacgtgagaatccttctgccac 
atccctcttagtccatttctgattactagtggt 

AF063249JL 760 PDSAPENITYKNISSGEIEISFL 

PDS APENI TYKNI S S GE IE+S FL 
V : V [gtg] PDSAPENITYKNISSGEIELSFL 
chrl2:88900001-87683 GGTGAGCT Intron 13 CAGTGcgagcgaaataaattggagcttc 

<1 [87684 : 97175] -1> cagccaatcaaatccgatatctt 

tttaaatctcattttagtgaact 

AF063249_1 784 PPRSPNGI IQKYTIYLKRSNSHEARTINTTSLTQTIG 

PP SPNGIIQKYTIYLKRSN +E RTINTTSLTQ I 
PPSSPNGIIQKYTIYLKRSNGNEERTINTTSLTQNIK 

chrl2 : 88900001-97247 ccaacagaacataatcaaaagaggaa&aaattacaaa 

ccggcagttaaactataggagaaagctaccctcaata 
cattctacaaatattcgattatgaatatactacacta 

AF063249JL 821 LKKYTHYVIEVSASTLKGEGIRS 

LKKYT Y+IEVSASTLKGEG+RS 
G : V [gta] LKKYTQYIIEVSASTLKGEGVRS 
chrl2: 88900001-97358 GGTAAAAG Intron 14 TAGTAcaatactaaggtgaacaggggca 

<1 [97359 : 97720] -1> taaacaattatccgctagagtgg 

ggatcatctggtttacataatgt 

AF063249_1 845 RPISILTEED PDSPPQNFSVKQL 

PISILTEED PDSPPQ+FSVKQL 
APISILTEED A: A [get] PDSPPQDFSVKQL 

chrl2:88900001-97792 gcaaacagggGGTAAATA Intron 15 CAGCTcgtcccgttgact 

cctgttcaaa <1 [97823 : 101475-1> cacccaatctaat 

tcatagggat tttctacctaagg 
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Fig. 14 (continued) 

AF063249_1 869 SGVTVMLSWQPPLEPNGIILYYTVYV 

SGVTV LSWQPPLEPNGI ILYYTVYV 

SGVTVKLSWQPPLEPNGI ILYYTVYV W : W [tgg] 

chrl2 : 88900001101517 tggagatttccccgcagaacttagtgTGGTAATAA Intron 16 

cgtctatcgacctacagtttaactat <2 [101597:125167 

t t cggggagaacggatatc ttcat tc 



AF063249J1 . 895 DKBSLRAINATEASLVLSDLDYNVDYGACVTASTRFGDGNARSSI 

++SSL+ IN TE SL LSDLDYNV+Y A VTASTRFGDG RS+I 
I^SSLKTIim^ETSLELSDIiDYNVEySAYVTASTRFGDGKTRSNI 

chrl2 : 88900001125165 CAGGaatttaaaagagattgttgtgtaggtagtgagaaatgggaaaaaa 

-2 > agcctactatcacctatcataaataagcatccgcgtgagacggat 
taaaaatttctaaaggaatgttttactttaatccatttgaaactc 



AF063249_1 . 941 INFRTPEG PSDPPNDVHYVNLSS 

I+F+TPEG PSDPP DV+Y NLSS 

ISFQTPEG E:A[gca] PSDPPKDVYYANLSS 

chrl2 : 88900001125304 aatcacggGGTGAGTT Intron 17 TAGCAcagccaggttgacat 

tgtaccag <1 [125329 : 12604-1> cgaccaataacatgc 

tctaaaga acttcattttacctt 



AFQ63249_1 965 SSI ILFWTPPVKPNG I IQYYS VYYQNTS GTFVQ 

SSIILFWTPP KPNGI IQYYSVYY+NTSGTF+Q 
SSI ILFWTPPSKPNGI IQYYSVYYRNTSGTFMQ 

chrl2 : 8890000112 6092 ttaacttacctacagaactttgttaaatgatac 

ccttttgccccacagttaaactaagaccgctta 
aaattcgattaattgtaatctttcattatttgg 



AF063249_1 998 NFTLLQVTKESDNVTVSARIYRLAIF 

NFTL +VT + DN+TVS I +L IF 
NFTLHEVTNDFDNMTVSTI IDKLTIF 

chrl2 : 88900001126191 GTAAGAA Intron 18 CAGataccggaagtgaaagtaaagacaat 

<0 [126191 : 13092-0>atctaatcaataatctccttaatctt 

ttactaactctctgtacatatagaac 



AF063249_1 1024 S YYTFWLTASTSVGNGNKS SD I IHVYTDQD 

S YYTFWLTASTSVGNGNKS SD 1 1 VYTDQD 
S YYTFWLTASTSVGNGNKS SD I IE VYTDQD 

chrl2 :88900001131007 attatttagaatggagaaaagaaggtagcg 

gaactgtccgcctgagaaggattatacaaa 
cctatgaaattatatgtactcctaacatac 



AF063249_1 . 1054 PEGPVGNLTFES I S STAIHVSWE 

PEG VGNLT+ESISSTAI+VSW 
I : I [ata] PEGFVGNLTYE SIS STAINVS WV 

chrl2: 88900001131097 AGTATGTA Intron 19 TAGTAcggtggacatgtattagaagatg 

<1 [131098 : 13270-1> cagttgatcaactcccctatggt 

tagttacgtcactgataatacgc 
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Fig, 14 (continued) 

AF063249_1 1078 P PS QPNGL VF YYLSLNLQQS PPRHMI PPLVTYENS IDFDDLEKYTD YI F 

PP+QPNGLVFYY+SL LQQ+P RH+ PPLVTYE SI FD+LEKYTDYI+ 
PPAQPNGLVFYYVSLILQQTP-RHVRPPLVTYERSIYFDNLEKYTDYIL 

chrl2 : 88900001132779 ccgccagcgtttgtcatccac ccgacccgatgaaattgacgatagtat 

cccacagtttaatctttaacc gatgccttcaaggtataataaacaatt 
agtaactacccttagcaggtt ctgaatttatgacattttgaactttaa 



AF063249JL 1127 KITPSTEKGFSETYTTQLHIKTEED 

KITPSTEKGFS+TYT QL+IKTEED 

KITPSTEKGFSDTYTAQLYIKTEED V : V [gtc] 

chrl2 :88900001132923 aaactagagttgatagcctaaagggGGTAGGCT Intron 20 

atccccaagtcacaccatatacaaa <1 [132999:133241 

attaaaagacttcttcgaccgtaat 



AF063249JL 1152 - PDTPPI INTFKNLS STS ILLSWDPPLKPNGAILG YHLTLQGPHA 

P+T PIINTFKNLSSTS+LLSWDPP+KPNGAI+ Y LTLQGP+ 
PETS P I INTFKNLS STS VLLS WDPPVKPNGAI I S YDLTLQGPNE 

chrl2 : 88900001133239 TAGTCcgatcaaaataacttatgctttgccgacaggaaatgtatcgcag 

-1> cacccttactaatcccctttcgacctacagcttgaatctagcaa 
aataaaccttactctcatcaagtcaagattaaatttataaaata 



AF063249_1 1197 NHTFVTS GNH I VLEELSPFTL YS FFAAARTMKGLGPS S I LFFYTDES 

N++F+TS N+I+LEELSPFTLYSFFAAART KGLGPSSILFFYTDES 
NYSFITSDNYIILEELSPFTLYSFFAAARTRKGLGPSSILFFYTDES 

chrl2: 88900001133376 atttaatgataatggctctattattgggaaaagcgctaactttaggt 

aacttccaaatttaatcctctagttcccgcgagtgccgttttacaac 
tttctttttcaagagtaataatctttcaataaatttctttctcatga 



AF063249JL 1244 PLAPPQNLTL INYTSDFVWLTWS 

PLAPPQNLTL IN TSDFVWL WS 
A : V [gtg] PLAPPQNLTL INCTSDFVWLKWS 

chrl2:88900001133517 GGTAAGCC Intron 21 CAGTGctgcccatataatatgtgtcata 

<1 -- [133518 : 13388-1> ctcccaatcttagccattgtagg 

gaataatgtaccttactaggagc 

AF063249_1 1268 PS PLPGGI VKVYS FKIHEHETDTVFYK 

PS PLPGGI VKVYS FKIHEHETDT+ + YK 

PSPLPGGIVKVYSFKIHEHETDTIYYK 
chrl2 : 88900001133954 cacccggagagtataacgcgagaattaGTAGGTT Intron 22 

cgctcggttat agtataaaacac taaa< 0 [134035:137708 

atttattttaatttattatatctattg 



AF 0 6 3 2 4 9 JL 1295 NISGLQTDAKLEGLEPVSTYS VS VSAFTKVGNGNQYSNWEFTTQE 

NISG +T+AKL GLEPVSTYS+ VS AFTKVGNGNQ+ SNW+ FTTQE 
NISGFKTEAKLVGLEPVSTYSIRVSAFTKVGNGNQFSNWKFTTQE 

chrl2 : 88900001137706 CAGaatgtaaggacggcgcgaattacgtgtaaggagactaaggataacg 

- 0 >at cgtacacattgtactgcactgtcctcatgagaatgattat ccaa 
taaatatacattagaacccctctatgccatatctatttaaacacaa 
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Fig. 14 (continued) 

AF063249_1 1341 S PEAvRNIECVARDWQSVSVRWD 

S P+ V+N++C+A WQSV V+WD 

S V:V[gtt] PDWQNMQCMATSWQSVLVKWD 

chrl2 :88900001137847 tGGTTAGAT Intron 23 AAGTTcgggcaactagaatctgtgatg 

c <1 [137851 :14061-1> cattaatagtccggactttaga 

a atcggtggcgatcggatagagt 

AF063249_1 . 1365 PPRKTNGI I IHYMI WGGNSTKVSPRDPTYTFTKLLPNTSYVFEVRAST 

PP+K NGII YM+TV NSTKVSP+D YTF KLL NTSYVF+VRAST 
PPKKANGIITQYMVTVERNSTKVSPQDHMYTFIKL^ 

chrl2 : 88900001140686 ccaagagaaactagaggaataagtccgcatataaccgaattgtagagta 

ccaacagttcaattctagaccatccaaatacttattcaccattatgccc 
acagataaaggtgaatagttaattcatcgctcagttctcatctaaatac 



AF063249_1 1414 S AGEGNE S RCD I S TLPET PSAPT 

SAGEG+ES C +STLPET PS PT 

S AGEGDE S TCHVS TLPET V;V[gtt] PSVPT 

chrl2: 88900001140833 tggggggaatcgaaccgaGGTAACTA Intron 24 TAGTTcagca 

ccgagaagcgatgctcac <1 [140888 : 17928-1> cgtcc 

attattacactccaataa cttca 



AF063249_1 1438 NVAFSNVQSTSATLTWTKPDTIFGYFQNYKITTQLRAQKCREWEPEECI 

N+AFS+VQSTSATLTW + PDT I +G YFQNYKI TTQLRAQKC +EWE EEC+ 
NIAFSDVQSTSATLTWIRPDTILGYFQNYKITTQLRAQKCKEWESEECV 

chrl2 :88900001179299 aagttggctaagatataacgaacgtt cataaaacccgcat agtgtggtg 

atctcataccgcctcgtgcacttgataaaatccatgcaagaagacaagt 
tttttttgattaagagaatctctcctatcatctatttaacaagacaatt 



EHQKDQ YLYEANQTEETVHGLKKFRWYRFQ VAAS TNVGYSNAS EWI STQ 
E+QK QYLYEA+ TEET V+GLKKFRWYRFQ VAAS TN GY NAS WIST+ 
E YQKI QYLYEAHLTEETVYGLKKFRWYRFQVAAS TNAGYGNAS1IW I STK 
gtcaactctggctaggagtgtaatattatcgggaaaggtgagtatataa 
aaaataataacatcaactagtaatggagtatccgcacgagaccagtcca 
ataatacctatcatagaataagatagtacagtcccttctcttacgttaa 



AF063249_1 1487 
Chrl2 : 88900001179446 



AF063249_1 1536 TLPG PDGPPENVHWATSPFGIN 

TLPG PDGPPENVHWATSPF 1+ 

TLPG P:P[cct] PDGPPENVHWATSPFSIS 

chrl2 : 88900001179593 a ccgCGTGAGTA Intron 25 TAGCTcggccgagcgggatctaaa 

ctcg <1 [179606 :19621-1> cagccaatattcccctgtg 

tgtc attttatttaaaaattccc 



AF063249_1 1560 ISWSEPAVITGPTFYLIDVKS VDDDD 

ISWSEPAVITGPT YLIDVKS VD+D+ 
ISWSEPAVITGPTCYLIDVKS VDNDE 

chrl2 : 88900001196271 aatagcggaagcattcaggatGTAAGGC Intron 26 TAGggagg 

tgggaccttcgccgattatac<0 [196334 : 20159-0>taaaa 

acgtattcttaaattgttcag attta 
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Fig. 14 (continued) 



AF063 24 9_1 1586 FNISFLKSNEENKTTE INNLEVFTRYS WITAFVGNVSRAYTDGKS SAE 

FNISF+KSNEENKT EI +LE+FTRYSWITAF GN+S AY +GKSSAE 
FNISFIKSNEENKTIEIKDLEIFTRYSWITAPTGNISAAYVEGKSSAE 

chrl2 : 88900001201607 taattaataggaaaagaagtgataattggaagtagaaaggtgggatagg 

tatcttacaaaaactataatattcgactttcctcgatgccatagacgca 
ttacccgataatacaatataaacagttagctattgctttataaggatta 

AF063 249_1 1635 VIITTLES PKDPPNNMTFQKIPD 

+I+TTLES PKDPPNNMTFQKIPD 
MIVTTLES V:A[gcc] PKDPPNNMTFQKIPD 

chrl2 :88900001201754 aagaatgtGGTAAGGA Intron 27 TAGCCcagccaaaatcaacg 

tttcctac <1 [201779:20474-1> caaccaatctaatca 

gttttaaa agcatccgatggaat 

AF063249_1 165 9 EVTKFQLTFLPPSQPNGNIRVYQALVteREDDPTAVQIHNFSIIQKTDTS 

EVTKFQLTFLPPS QPNGNI +VYQALVYREDDPTAVQIHN+S I IQKT+T 
EVTKFQLTFLPPSQPNGNIQVYQALVYREDDPTAVQIHNLSIIQKTNTF 

chrl2 : 88900001204792 ggaatctatccctccagaacgtcgcgtcgggcaggcaeacaaacaaaat 

atcatatcttcccacagatataacttagaaaccctataatgttaacact 
ataataagcttttattatcaatatgtcaatttttcgtcccttagaccac 

AF063 24 9_1 1708 1 1 AMLEGLKGGHTYNI S VYAINSAGA 

+ 1 AMLEGLKGGHTYNI S VYA+NSAGA 
VI AMLEGLKGGHTYNI S VYAVNSAGA 

chrl2 : 88900001204939 gagacggcaggcataaaGTAAGAA Intron 28 TAGgtggaaggg 

1 1 ctt agt aggacaatg< 0 - - [204990:20735-0 > tac tagcgc 

ctagaaaaatatactct tcactttta 



AF063249_1 1734 GPKVQMRITMDIK PARPKSKPIP 

GPKV MRITMDIK PARPK+KP P 

GPKVPMRITMDIK A: A [get] PARPKTKPTP 

chrl2 :88900001207380 gcagcaaaaagaaGGTACATA Intron 29 TAGCTcgccaaacac 

gcatctgtctata <1 [207420 :21053-1> ccgcacaccc 

tagtggaacgtca aaaaacaact 

AF063249JL 1758 IRDATGKLLVTSTTITIRMPI C YYNDDHGP IRNVQVLVAETG 

I DATGKLLVTSTTITIRMPI CYY+DDHGPI +NVQVLV ETG 
- I YDATGKLLVTSTTI T IRMP I CYYSDDHGP I KNVQ VLVTETG 

chr i2 :88900001210569 atggagac cgat aaaaaaacatt t aggcgcaaagcgcgagag 

taaccgatttcccctctgtctgaagaaagctaatatttcacg . 
tttcaaagtgtaaatacagaatccttttaaaataagtgaaaa 



AF063249_1 1800 QQDGNVTKWYDAYFNKARPYFTN 

Q DGNVTKWYDAYFNKARPYFTN 
A: A [get] QHDGNVTKWYDAYFNKARPYFTN 
chrl2: 88900001210695 GGTATCAT Intron 30 CAGCTccggagaattggttaagacttaa 

<1 [210696: 21130-1> aaagatcagaacataacgcatca 

gttataaggttatttaagattat 
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Fig. 14 (continued) 

AF063249_1 1824 EGFPNPPCIEGKTKFSGNEEIYVIGADNACMIPGNEEKICNGPLKPKKQ 

EGPPNPPC EGKTKFSGNEE I Y+ IGADNACMIPGNE +KI CNGPLKPKKQ 
EGPPNPPCTEGKTKFSGNEE I YI IGADNACMIPGNEDKI CNGPLKPKKQ 

chrl2 : 88900001211378 ggtcacctaggaaatagaggataagggagtaacgaggaatagccacaac 

agtcaccgcagacatggaaatattgcaacgttcgaaaatgagctacaaa 
acttctataaagagttctaacccattttacgttctacatctaagaaaga 

AF063249_1 1873 Y FKFRATNVMGQFTDSEYSDPIK 

Y FKFRATN+MGQFTDS+YSDP+K 

Y L : L [ tta] FKFRATNIMGQFTDSDYSDPVK 
chrl2: 88900001211525 tTTGTAAGTA Intron 31 TAGAtatagaaaagctagtgttgcga 

a <2 [211530:21321-2> tatgccattgatcacaacacta 

tatatattgaattcttttcttg 



c 



AF063249JL 1897 TL EGLSERTVEIILSVTLCILSI 

TL EGLSERTVEI ILSVTLCILSI 

TL G:G[ggg] EGLSERTVEIILSVTLCILSI 

chrl2: 88900001213281 atGGTAAGAC Intron 32 CAGGGggctgaaggaactgattacta 

ct <1 [213288 :22334-l> agtcagctatttctctgttct 

ta aataaacagcttcctgtctaa 

AF063249JL 1921 I LLGTAI F AFV IRQKQKEGGTYS 

ILLGTAIFAF IRQKQKEGGTYS 
I LLGTAI FAFA R : R [aga] IRQKQKEGGTYS 

chrl2: 88900001223410 accgagatgtgAGGTAAGAT Intron 33 TAGAaccacagggatt 

tttgccttctc <2 [223445 :22609-2> tgaaaaaggcac 

tctaatttata tagggaatcact 

AF063249_1 1945 PRDAEIIDTKFKLDQLITVADLELKDERLT 

P+DAEIIDTK+KLDQLITVADLELKDERLT 

PQDAEIIDTKLKLDQLITVADLELKDERLT 
chrl2 : 88900001226135 ccgggaagaatacgccaagggcgcaggata 

caacattacatataattctcatataaagtc 

tgtaattctagggtgccagacgaggcgaag 

AF063249JL 1975 LLSYRKSI 

LLSYRKSI 
R:R[cgg], LLSYRKSI 
Chrl2: 88900001226225 CGGTGAGCA Intron 34 CAGGtcataata 

<2 [226227:24004-2> ttgagact 

atttaacc 

AF063249_1 1984 PIS KKS FLQHVEELCTNSNLKFQ 

PISKKSFLQHVEELCTN+NLKFQ 
K : K [aag] PIS KKS FLQHVEELCTNNNLKFQ 

chrl2: 88900001240075 AAGTAAGTT Intron 35 GAGGcaaaattcccgggctaaaacatc 

<2 [240077 :24074-2> ctgaacttaataatgcaaatata 

aacgaccgattagtcacccagta 



WO 03/033688 



PCT/EP02/11473 



- 25/27 - 



Fig. 14 (continued) 

AF063249JL 2008 EEFS ELPKFLQDLS STDADLPWNRAK 

EEFS ELPKFLQDLS STDADLPWNRAK 

EEFS ELPKFLQDLS STDADLPWNRAK 

chrl2: 88900001240811 ggttGTATGTT Intron 36 CAGgtcatccgcttagggcctaaga 

aatc<0 [240823 :24389-0>atcattaatcccacatcgagca 

a atg aaaattgtttattttgtgtaaa 

AF063249_1 2034 NRFPNIKP NNNRVKLIADVSLPG 

NRFPNIKP NNNRVKLIAD S+PG 

NRFPNIKP " Y:Y[tat] NNNRVKLIADASVPG 

chrl2 : 88900001243963 a c t c a aa cTGTATGTG Intron 37 TAGATaaaagacagggagcg 

agtcatac <1 [243988 : 24912-1> aaagtattcacgtcg 

cccacaaa ttcaaggatctttat 

AF063249_1 2058 SDYINASYVS GYLCPNEFIATQGPLP 

SDYINASY+S GYLCPNEFIATQGPLP 
SDYINASYIS GYLCPNEFIATQGPLP 

chrl2:88900001249173 tgtaagatatGTAAGTT Intron 38 TAGgtttcagtagacgccc 

caatacgatc<0 [2492 03:260 14-0 >gatgcaattccagctc 

gttttccttt ttatatattttataaa 

AF063249_1 2084 GTVGDFWRJWWETRTKTLVMLTQCFEKGR 

GTVGDFW+MVWETR KTLVMLTQCFEKGR 

GTVGDFWKMVWETRAKTLVMLTQCFEKGR 
chrl2 : 88900001260196 gagggttaagtgaagaatgacacttgagcGTAAGTT Intron 39 

gctgatgattgacgcacttttcagtaagg<0 [260283 : 260479 

aatattgagggacaaaaaagaagttaaag 

AF0 6 3 2 4 9__1 2113' IRCHQYWPEDNKPVTVFGDI VITKLMEDIQIDWTIRDLKIER 

IRCHQYWPEDNKPVTVFGDIVITKLMED+QIDWTIRDLKIER 
IRCHQYWPEDNKPVTVFGDIVITKLMEDVQIDWTIRDLKIER 

chrl2 : 88900001260477 CAGaatccttcggaacgagtggagaaacagggcagtaaagcaaga 

-0>tggaaagcaaaactcttgatttcattaatatagctgatatag 
cactgtgagccgattctatagtagaggttaatgtcgtgatag 

AF06324 9_1 2155 HGDCMTVRQCNFTGWPEHGVPENTTP 

HGDCMTVRQCNFT WPEHGVPEN+ P 
HGDCMTVRQ'CNFTAWPEHGVPENSAP 

chrl2 : 88900001260606 GTAAAAA Intron 40 TAGcggtaagcctatagtcgcggcgaagc 

<0 [260606:26148 - 0 >agagtc tgagat ccgcaagtcaagcc 

tgtcgttagtcttcgagtgttgccct 

AF063249_1 2181 LIHFVKLVRTSRAHDTTPMWHC 

LIHFVKLVR * SRAHDTTPM+VHC 

LIHFVKLVRASRAHDTTPMIVHC S : S [ agt ] 

chrl2 : 88900001261563 cactgatgcgaagcgaacaagctAGGTGAGAA Intron 41 TAGT 

ttattattgcggcaaccctttag <2 [261634 : 26430-2> 

atctgggtaacgatccatgttcc 
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Fig. 14 (continued) 

AF06324 9_1 2205 AGVGRTGVF IALDHLTQHINNHDPVDI YGLVAELRSERMCMVQNL 

AGVGRTGVFIALDHLTQHIN+HDFVDIYGLVAELRSERMCMVQNL 
AGVGRTGVF I ALDHLTQH INDHD FVD I YGL VAELRS ERM CMVQNL 

chrl2 : 88900001264311 ggggaaggtagcgctaccaagcgtggabgcgggcaagaatagcac 

cgtggcgtttctaatcaataaaattatagttcatggagtgttaat 
tataatattttgctaaatatctttgtataaatagataagcgggtg 

AF06324 9_1 2250 AQYIFLHQCILDLLSNKGGHQPVCFV 

AQYIFLHQCILDLLSNKG +QP+CFV 
AQYI FLHQC ILDLL SlNTKGSNQP I CFV 

chrl2 : 88900001264446 GTAAGAT Intron 42 TAGgctattcctacgcttaagaaccattg 

<0 [264446 : 26974 -0>caatttaagttattcaaggaactgtt 

agtctacgctgtcaatgattgccttt 

AF063249_1 2276 NYSTLQKMDSLDAME DVELEWEE 

NYS LQKMDSLDAME DVELEWEE 
NYSALQKMDSLDAME G:G[ggt] DVELEWEE 

chrl2 : 8890000 attgccaagttggagGGTAAACA Intron 43 TAGGTgggcgtgg 

aacctaatactacta <1 [314473 : 31473 -1> atatagaa 

ctaatgggctgccga ttgtagaa 

AF063249 2300 TTM 

TTM 
TTM 

chrl2 : 8890000 aaa 

cct 
ctg 
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Fig. 15 

492_BAC__p 1 ENS E S FLWS TTS PS PTLGRVTPTVRTTQ S S S TAARS KI S S VWKEP I S F V 

ENSESFLWST SPSPTLGRVTP RTT SSST +++ISSVWKEPISFV 
ENSESFLWSTASPSPTLGRVTPPSRTTHSSSTLTQNEISSVWKEPISFV 

BAC 29570 gaagttttaagactcacgagacctcaacttaatacagaatgtagcaatg 

aagacttggccgcccctggtccccgccaccgctcaaatgctgaactgtt 
gttattagtaccttacttatatagtcataacggagtgcctggagtctta 

492_BAC_p 50 OTHLRPYTTYLFEVSAVTTEAGYIDSTIVRTPES 

VTHLRPYTTYLFEVSA TTEAGYIDSTIVRTPES 
VTHLRPYTTYLFEVSAATTEAGYIDSTIVRTPES 

BAC 29717 gactactaatctggtggaagggtagaaagaacgt 

tcatgcaccattatcccccacgatagcttgccac 
gacgattaatttatattataatttttgtcaaaaa 

492_BAC_J> 84 VPEGPPQNCIMGNVTGKAFSISV7DPPTIVTGKFSYRVELYGPS 

VPEGPPQNC+ GN+TGK+FSI WDPPTIVTGKFSYRVELYGPS 
VPEGPPQNCVTGNITGKSFSILWDPPTIVTGKFSYRVELYGPS 

BAC -16606 gcggcccatgagaaagatttattgccaagagatataggttgct 

tcagccaagtcgatcgactcttgacccttcgatgagtatagcc 
gtaaaaaccaacccaagctatagcaataaagatttataataaa 

492_BAC_J) 127 RILDNSTKDLRFAFTHLTPFTMY 

RILDNSTKDL+FAFT+LTPFTMY 
G : G [ggt] RILDNSTKDLKFAFTNLTPFTMY 
BAC -16477 GGTAAGCC Intron 1 CAGGTcatgaaaagcatgtaacactaat 

<1 [16476 : 13211] -1> gttaagcaatatctcatcctcta 

ctgtccaaccgtactcaaatagt 

492JBAC_p 151 DVYVAAETSAGVGPKSNLSVFTPPD 

DVY+AAETSAG GPKSN+SVFTPPD 

DVYIAAE TSAGTGPKSNI SVFTPPD V : V [gt t ] 

BAC -13139 ggtagggaaggagcataatgtaccgGGTAAGAA Intron 2 

atatccacgcgcgcacatcttccca <1 [13063: 883] 

tcttgtactagtgcgattaactaat 

492_BAC _p 176 PG AVFDLQ I AEVE ATE IRI T WRKPRQPNG IIS Q YRVKVS VLETG 

PGAVFDLQ+AEVE+T++RITW+KPRQPNGII+QYRVKV V ETG 
PGAVFDLQLAEVESTQVRITWKKPRQPNGI INQYRVKVLVPETG 

BAC -885 TAGTTcgggtgtccggggtacgaaataaccccagaaactcgagcgcgag 

- 1> cgcttatatcataccatgtcgaacgacagttaaagtatttcacg 
agagttaatagaacgaaattggaaaaatattcacagagatagaa 

492_BACjp 221 WLENTLLTGQDE 

++LENTLLTG +E 

IILENTLLTGNNE 
BAC -748 aatgaatcagaag 

tttaacttcgaaa 

atgattgctattg 
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SEQUENCE LISTING 

<110> Bayer AG 

<120> REGULATION OF HUMAN RECEPTOR TYROSINE PHOSPHATASE 

<130> Lio395 

<150> USSN 60/329 , 329 

<151> 2001-10-16 

<160> 8 

<170> Patentln version 3.1 

<210> 1 

<211> 6897 

<212> DNA 

<213> Homo sapiens 



<400> 1 

atggattttc ttatcatttt tcttttactt tttattggga cttcagagac acaggttgat 60 

gtttccaatg tcgttcctgg tactaggtac gatataacca tctcttcaat ttctacaaca 120 

tacacctcac ctgttactag aatagtgaca acaaatgtaa caaaaccagg gcctccagfcc 180 

ttcctagccg gggaaagagt cggatctgct gggattcttc tgtcttggaa tacaccacct 240 

aatccaaatg gaaggattat atcttacatt gtcaaatata aggaagtttg tccgtggatg 300 

caaacagtat atacacaagt cagatcaaag ccagacagtc tggaagttct tcttactaat 360 

cttaatcctg gaacaacata tgaaattaag gttgctgctg aaaacagtgc tggcattgga 420 

gtgtttagtg atccatttct cttccaaact gaagaaagtg ctccaggaaa agtggtgaat . 480 

ctcacagttg aggcctacaa cgcttcagca gttaagctga tttggtattt acctcggcaa 540 
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ccaaatggca aaattaccag cttcaagatt agtgtcaagc atgccagaag tgggatagta 600 

gtgaaagatg tctcaatcag agtagaggac attttgactg ggaaattgcc agaaagcaat 660 

gagaatagtg aatctttttt atggagtaca gccagccctt ctccaaccct tggtagagtt 720 

acacctccat cgcgtaccac acattcatca agcacgttga cacagaatga gatcagctct 780 

gtgtggaaag agcctatcag ttttgtagtg acacacttga gaccttatac aacatatctt 840 

tttgaagttt cagctgctac aactgaagca ggttatattg atagtacgat tgtcagaaca 900 

ccagaatcag tgcctgaagg accaccacaa aactgcgtaa caggcaacat cacaggaaag 960 

tccttttcaa ttttatggga cccaccaact atagtaacag ggaaatttag ttafcagagtt 1020 

gaattatatg gaccatcagg tcgcattttg gataacagca caaaagacct caagtttgca 1080 

ttcactaacc taacaccatt tacaatgtat gatgtctata ttgcggctga aaccagtgca 1140 

gggactgggc ccaagtcaaa tatttcagta ttcactccac cagatgttcc aggggcagtg 1200 

tttgatttac aacttgcaga ggtagaatcc acgcaagtaa gaattacttg gaagaaacca 1260 

cgacaaccaa atggaattat taaccaatac cgagtgaaag tgctagttcc agagacagga 1320 

ataattttgg aaaatacttt gctcactgga aataatgagt atataaatga ccccatggct 1380 

ccagaaattg tgaacatagt agagccaatg gtaggattat atgagggttc agcagagatg 1440 

tcgtctgacc ttcactcact tgctacattt atatataaca gccatccaga taaaaacttt 1500 

ccfcgcaagga atagagctga agaccagact tcaccagttg taactacaag gaatcagtat 1560 

attactgaca ttgcagctga acagctgtct tatgttatca ggagacttgt accttfccact 1620 

gagcacatga ttagtgtatc tgctttcacc atcatgggag aaggaccacc aacagttctc 1680 

agtgttagga cacgtcagca agtgccaagc tccattaaaa ttataaacta taaaaatatt 1740 

agttcttcat ctattttgtt atattgggat cctccagaat atcccaatgg aaaaataact 1800 

cactatacga tttatgcaat ggaattggat acaaacagag cattccagat aactaccata 1860 

gataacagct ttctcataac agggttaaag aaatacacaa aatacaaaat gagagtggca 1920 

gcctcaaccc acgttggaga aagttctttg tctgaagaaa atgacatctt tgtgagaact 1980 

tcagaagatg aaccggaatc atcacctcaa gatgtcgaag taattgatgt taccgcagat 2040 

gaaataaggt tgaagtggtc accacccgaa aagcccaatg ggatcattat tgcttatgaa 2100 

gtgctatata aaaatataga tactttatat atgaagaaca catcaacaac agacataata 2160 

ttaaggaact taagacctca caccctctat aacatttctg taaggtctta caccagattt 2220 

ggtcatggca atcaggtatc ttctttactc tctgtaagga cttcggagac tgtgcctgat 2280 

agtgcaccag aaaatatcac ttacaaaaat atttcttctg gagagattga gctafccattc 2340 



WO 03/033688 PCT/EP02/1 1473 



-3 



cttcccccaa gtagtcccaa tggaatcata caaaaatata caatttatct caagagaagt 2400 
aatggaaatg aggaaagaac tataaataca acctctttaa cccaaaacat taaagtactg 2460 

aagaaatata cccaatatat cattgaggtg tctgctagta cactcaaagg tgaaggagtt 2520 

cggagtgctc ccataagtat actgacggag gaagatgctc ctgattctcc ccctcaagac 2580 

ttctctgtaa aacagttgtc tggtgtcacg gtgaagttgt catggcaacc acccctggag 2640 

ccaaatggaa ttatccttta ttacacagtt tatgtctgga atagatcatc attaaaaact 2700 

attaatgtca ctgaaacatc attggagtta tcagatttgg attataatgt tgaatacagt 2760 

gcttatgtaa cagctagcac cagatttggt gatgggaaaa caagaagcaa tatcattagc 2820 

tttcaaacac cagagggagc accaagcgat cctcccaaag atgtttatta tgcaaacctc 2880 

agttcttcat caataattct tttctggaca cctccttcaa aacctaatgg gattatacaa 2940 

tattactctg tttattacag aaatacttca ggtactttta tgcagaattt tacactccat 3000 

gaagtaacca atgactttga caatatgact gtatccacaa ttatagataa actgacaata 3060 

ttcagctact atacattttg gttaacagca agtacttcag ttggaaatgg gaataaaagc 3120 

agtgacatca ttgaagtata cacagatcaa gacatacctg aagggtttgt tggaaacctg 3180 

acttacgaat ccatttcgtc aactgcaata aatgtaagct gggtcccacc ggctcaacca 3240 

aacggtctag tcttctacta tgtttcactg atcttacagc agactcctcg ccatgtgaga 3300 

ccacctcttg ttacatatga gagaagcata tattttgata atctggaaaa atacactgat 3360 

tatatattaa aaattactcc atcaacagaa aagggattct ctgataccta tactgcccag 3420 

ctatacatca agactgaaga agatgtccca gaaacttcac caataatcaa cacttttaaa 3480 

aacctttcct ctacctcagt tctcttatca tgggatcccc cagtaaagcc aaatggtgca 3540 

ataataagtt atgatttaac tttacaagga ccaaatgaaa attattcttt cattacttct 3600 

gataattaca taatattgga agagctttca ccatttacat tatatagctt ttttgctgcc 3660 

gcaagaacta gaaaaggact tggtccttcc agtattcttt tcttttacac agatgagtca 3720 

gtgccgttag cacctccaca aaatttgact ttaatcaact gtacttcaga ctttgtatgg 3780 

ctgaaatgga gcccaagtcc tcttccaggt ggtattgtta aagtatatag ttttaaaatt 3840 

catgaacatg aaactgacac tatatattat aagaatatat caggatttaa aactgaagcc 3900 

aaacttgttg gactggaacc agtcagcacc tactctatcc gtgtatctgc gttcaccaaa 3960 

gttggaaatg gcaatcaatt tagtaatgta gtaaaattca caacccaaga atcagttcca 4020 

gatgtcgtgc agaatatgca gtgcatggca actagctggc agtcagtttt agtgaaatgg 4080 
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gatccaccca aaaaggcaaa tggaataata acgcagtata tggtaacagt tgaaaggaat 4140 
tctacaaaag tttctcccca agatcacatg tacactttca taaagcttct tgccaatacc 4200 
tcatatgtct ttaaagtaag agcttcaacc tcagctggtg aaggtgatga aagcacatgc 4260 
catgtcagca cactacctga aacagttccc agtgttccca caaatattgc tttttctgat 4320 
gttcagtcaa ctagtgcaac attgacatgg ataagacctg acactatcct tggctacttt 4380 
caaaattaca aaattaccac tcaacttcgt gctcaaaaat gcaaagaatg ggaatccgaa 4440 
gaatgtgttg aatatcaaaa aattcaatac ctctatgaag ctcacttaac tgaagagaca 4500 
gtatatggat taaagaaatt tagatggtat agattccaag tggctgccag caccaatgct 4560 
ggctatggca atgcttcaaa ctggatttct acaaaaactc tgcctggccc tccagatggt 4620 
cctcctgaaa atgttcatgt agtagcaaca tcacctttta gcatcagcat aagctggagt 4680 
gaacctgctg tcattactgg accaacatgt tatctgattg atgtcaaatc ggtagataat 4740 

gatgaattta atatatcctt catcaagtca aatgaagaaa ataaaaccat agaaattaaa 4800 

gatttagaaa tattcacaag gtattctgta gtgatcactg catttactgg gaacattagt 4860 

gctgcatatg tagaagggaa gtcaagtgct gaaatgattg ttactacttt agaatcagcc 4920 

ccaaaggacc cacctaacaa catgacattt cagaagatac cagatgaagt tacaaaattt 4980 

caattaacgt tccttcctcc ttctcaacct aatggaaata tccaagtata tcaagctctg 5040 

gtttaccgag aagatgatcc tactgctgtc cagattcaca acctcagtat tatacagaaa 5100 

accaacacat tcgtcattgc aatgctagaa ggactaaaag gtggacatac atacaatatc 5160 

agtgtttacg cagtcaatag tgctggtgca ggtccaaagg ttccgatgag aataaccatg 5220 

gatatcaaag ctccagcacg accaaaaacc aaaccaaccc ctatttatga tgccacagga 5280 

aaactgcttg tgacttcaac aacaattaca atcagaatgc caatatgtta ctacagtgat 5340 

gatcatggac caataaaaaa tgtacaagtg cttgtgacag aaacaggagc tcagcatgat 5400 

ggaaatgtaa caaagtggta tgatgcatat tttaataaag caaggccata ttttacaaat 5460 

gaaggctttc ctaaccctcc atgtacagaa ggaaagacaa agtttagtgg caatgaagaa 5520 

atctacatca taggtgctga taatgcatgc atgattcctg gcaatgaaga caaaatttgc 5580 

aatggaccac tgaaaccaaa aaagcaatac ttatttaaat ttagagctac aaatattatg 5640 

ggacaattta ctgactctga ttattctgac cctgttaaga ctttagggga aggactttca 5700 

gaaagaaccg tagagatcat tctttccgtc actttgtgta tcctttcaat aattctcctt 5760 

ggaacagcta tttttgcatt tgcaagaatt cgacagaagc agaaagaagg tggcacatac 5820 

tctcctcagg atgcagaaat tattgacact aaattgaagc tggatcagct catcacagtg 5880 
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gcagacctgg aactgaagga cgagagatta acgcggttac ttagttatag aaaatccatc 5940 

aagccaataa gcaagaaatc cttcctgcaa catgttgaag agctttgcac aaacaacaac 6000 

ctaaagtttc aagaagaatt ttcggaatta ccaaaatttc ttcaggatct ttcttcaact 6060 

gatgctgatc tgccttggaa tagagcaaaa aaccgcttcc caaacataaa accatataat 6120 

aataacagag taaagctgat agctgacgct agtgttccag gttcggatta tattaatgcc 6180 

agctatattt ctggttattt atgtccaaat gaatttattg ctactcaagg tccactacca 6240 

ggaacagttg gagatttttg gaaaatggtg tgggaaacca gagcaaaaac attagtaatg 6300 

e.taacacagt gttttgaaaa aggacggatc agatgccatc agtattggcc agaggacaac 6360 

aagccagtta ctgtctttgg agatatagtg attacaaagc taatggagga tgttcaaata 6420 

gattggacta tcagggatct gaaaattgaa aggcatgggg attgcatgac tgttcgacag 6480 

tgtaacttta ctgcctggcc agagcatggg gttcctgaga acagcgcccc tctaattcac 6540 

tttgtgaagt tggttcgagc aagcagggca catgacacca cacctatgat tgttcactgc 6600 

agtgctggag ttggaagaac tggagttttt attgctctgg accatttaac acaacatata 6660 

aatgaccatg attttgtgga tatatatgga ctagtagctg aactgagaag tgaaagaatg 6720 

tgcatggtgc agaatctggc acagtatatc tttttacacc agtgcattct ggatctctta 6780 

tcaaataagg gaagtaatca gcccatctgt tttgttaact attcagcact tcagaagatg 6840 

gactctttgg acgccatgga aggtgatgtt gagcttgaat gggaagaaac cactatg 6897 

<210> 2 

<211> 2299 

<212> PRT 

<213> Homo sapiens 

<400> 2 

Met Asp Phe Leu lie lie Phe Leu Leu Leu Phe lie Gly Thr Ser Glu 
1 5 10 15 

Thr Gin Val Asp Val Ser Asn Val Val Pro Gly Thr Arg Tyr Asp lie 
20 25 30 



Thr lie Ser Ser lie Ser Thr Thr Tyr Thr Ser Pro Val Thr Arg He 
35 40 45 
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Val Thr Thr Asn Val Thr Lys Pro Gly Pro Pro Val Phe Leu Ala Gly 
50 55 60 



Glu Arg Val Gly Ser Ala Gly lie Leu Leu Ser Trp Asn Thr Pro Pro 
65 70 75 80 



Asn Pro Asn Gly Arg lie lie Ser Tyr lie Val Lys Tyr Lys Glu Val 
85 90 95 



Cys Pro Trp Met Gin Thr Val Tyr Thr Gin Val Arg Ser Lys Pro Asp 
100 105 110 



Ser Leu Glu Val Leu Leu Thr Asn Leu Asn Pro Gly Thr Thr Tyr Glu 
115 120 125 



lie Lys Val Ala Ala Glu Asn Ser Ala Gly lie Gly Val Phe Ser Asp 
130 135 140 



Pro Phe Leu Phe Gin Thr Ala Glu Ser Ala Pro Gly Lys Val Val Asn 
145 150 155 ^ 160 



Leu Thr Val Glu Ala Tyr Asn Ala Ser Ala Val Lys Leu He Trp Tyr 
165 170 175 



Leu Pro Arg Gin Pro Asn Gly Lys He Thr Ser Phe Lys He Ser Val 
180 185 190 



Lys His Ala Arg Ser Gly He Val Val Lys Asp Val Ser He Arg Val 
195 200 205 



Glu Asp He Leu Thr Gly Lys Leu Pro Glu Ser Asn Glu Asn Ser Glu 
210 215 220 



Ser Phe Leu Trp Ser Thr Ala Ser Pro Ser Pro Thr Leu Gly Arg Val 
225 230 235 ~ 240 



Thr Pro Pro Ser Arg Thr Thr His Ser Ser Ser Thr Leu Thr Gin Asn 
245 250 255 



Glu He Ser Ser Val Trp Lys Glu Pro He Ser Phe Val Val Thr His 
260 265 270 



Leu Arg Pro Tyr Thr Thr Tyr Leu Phe Glu Val Ser Ala Ala Thr Thr 
275 280 285 
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Glu Ala Gly Tyr lie Asp Ser Thr He Val Arg Thr Pro Glu Ser Val 
290 295 300 



Pro Glu Gly Pro Pro Gin Asn Cys Val Thr Gly Asn He Thr Gly Lys 
305 310 315 320 



Ser Phe Ser He Leu Trp Asp Pro Pro Thr He Val Thr Gly Lys Phe 
325 330 335 



Ser Tyr Arg Val Glu Leu Tyr Gly Pro Ser Gly Arg He Leu Asp Asn 
340 345 ~ 350 



Ser Thr Lys Asp Leu Lys Phe Ala Phe Thr Asn Leu Thr Pro Phe Thr 
355 360 365 



Met Tyr Asp Val Tyr He Ala Ala Glu Thr Ser Ala Gly Thr Gly Pro 
370 375 380 



Lys Ser Asn He Ser Val Phe Thr Pro Pro Asp Val Pro Gly Ala Val 
385 390 395 ~ 400 



Phe Asp Leu Gin Leu Ala Glu Val Glu Ser Thr Gin Val Arg He Thr 
405 410 415 



Trp Lys Lys Pro Arg Gin Pro Asn Gly He He Asn Gin Tyr Arg Val 
420 425 430 



Lys Val Leu Val Pro Glu Thr Gly He He Leu Glu Asn Thr Leu Leu 
435 440 445 



Thr Gly Asn Asn Glu Tyr He Asn Asp Pro Met Ala Pro Glu He Val 
450 455 460 



Asn He Val Glu Pro Met Val Gly Leu Tyr Glu Gly Ser Ala Glu Met 
465 470 475 480 



Ser Ser Asp Leu His Ser Leu Ala Thr Phe He Tyr Asn Ser His Pro 
485 490 495 



Asp Lys Asn Phe Pro Ala Arg Asn Arg Ala Glu Asp Gin Thr Ser Pro 
500 505 " 510 
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Val Val Thr Thr Arg Asn Gin Tyr He Thr Asp He Ala Ala Glu Gin 
515 520 525 



Leu Ser Tyr Val He Arg Arg Leu Val Pro Phe Thr Glu His Met He 
530 535 540 



Ser Val Ser Ala Phe Thr He Met Gly Glu Gly Pro Pro Thr Val Leu 
545 550 555 560 



Ser Val Arg Thr Arg Gin Gin Val Pro Ser Ser He Lys He He Asn 
565 570 ~ 575 



Tyr Lys Asn He Ser Ser Ser Ser He Leu Leu Tyr Trp Asp Pro Pro 
580 585 590 



Glu Tyr Pro Asn Gly Lys He Thr His Tyr Thr He Tyr Ala Met Glu 
595 600 605 



Leu Asp Thr Asn Arg Ala Phe Gin He Thr Thr He Asp Asn Ser Phe 
610 615 620 



Leu He Thr Gly Leu Lys Lys Tyr Thr Lys Tyr Lys Met Arg Val Ala 
625 630 635 640 



Ala Ser Thr His Val Gly Glu Ser Ser Leu Ser Glu Glu Asn Asp He 
645 650 655 



Phe Val Arg Thr Ser Glu Asp Glu Pro Glu Ser Ser Pro Gin Asp Val 
660 665 670 



Glu Val He Asp Val Thr Ala Asp Glu He Arg Leu Lys Trp Ser Pro 
675 680 685 



Pro Glu Lys Pro Asn Gly He He He Ala Tyr Glu Val Leu Tyr Lys 
690 695 700 



Asn He Asp Thr Leu Tyr Met Lys Asn Thr Ser Thr Thr Asp He He 
705 710 715 ~ 720 



Leu Arg Asn Leu Arg Pro His Thr Leu Tyr Asn He Ser Val Arg Ser 
72 5 730 735 



Tyr Thr Arg Phe Gly His Gly Asn Gin Val Ser Ser Leu Leu Ser Val 
740 745 750 
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Arg Thr Ser Glu Thr Val Pro Asp Ser Ala Pro Glu Asn lie Thr Tyr 
755 760 765 



Lys Asn He Ser Ser Gly Glu He Glu. Leu Ser Phe Leu Pro Pro Ser 
770 775 780 



Ser Pro Asn Gly He He Gin Lys Tyr Thr He Tyr Leu Lys Arg Ser 
785 790 795 ~ 800 



Asn Gly Asn Glu Glu Arg Thr He Asn Thr Thr Ser Leu Thr Gin Asn 
805 810 815 



He Lys Val Leu Lys Lys Tyr Thr Gin Tyr He He Glu Val Ser Ala 
820 825 830 



Ser Thr Leu Lys Gly Glu Gly Val Arg Ser Ala Pro He Ser He Leu 
835 840 845 



Thr Glu Glu Asp Ala Pro Asp Ser Pro Pro Gin Asp Phe Ser Val Lys 
850 855 860 



Gin Leu Ser Gly Val Thr Val Lys Leu Ser Trp Gin Pro Pro Leu Glu 
865 870 875 880 



Pro Asn Gly He He Leu Tyr Tyr Thr Val Tyr Val Trp Asn Arg Ser 
885 890 " 895 



Ser Leu Lys Thr He Asn Val Thr Glu Thr Ser Leu Glu Leu Ser Asp 
900 905 910 



Leu Asp Tyr Asn Val Glu Tyr Ser Ala Tyr Val Thr Ala Ser Thr Arg 
915 920 925 



Phe Gly Asp Gly Lys Thr Arg Ser Asn He He Ser Phe Gin Thr Pro 
930 935 940 



Glu Gly Ala Pro Ser Asp Pro Pro Lys Asp Val Tyr Tyr Ala Asn Leu 
945 950 955 960 



Ser Ser Ser Ser He He Leu Phe Trp Thr Pro Pro Ser Lys Pro Asn 
965 970 ~ 975 



Gly He He Gin Tyr Tyr Ser Val Tyr Tyr Arg Asn Thr Ser Gly Thr 
980 985 990 
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Phe Met Gin Asn Phe Thr Leu His Glu Val Thr Asn Asp Phe Asp Asn 
995 1000 1005 



Met Thr Val Ser Thr lie lie Asp Lys Leu Thr He Phe Ser Tyr 
1010 1015 1020 



Tyr Thr Phe Trp Leu Thr Ala Ser Thr Ser Val Gly Asn Gly Asn 
1025 1030 1035 



Lys Ser Ser Asp He He Glu Val Tyr Thr Asp Gin Asp He Pro 
1040 " 1045 1050 



Glu Gly Phe Val Gly Asn Leu Thr Tyr Glu Ser He Ser Ser Thr 
1055 1060 1065 



Ala He Asn Val Ser Trp Val Pro Pro Ala Gin Pro Asn Gly Leu 
1070 1075 1080 



Val Phe Tyr Tyr Val Ser Leu He Leu Gin Gin Thr Pro Arg His 
1085 " ~ 1090 1095 



Val Arg Pro Pro Leu Val Thr Tyr Glu Arg Ser He Tyr Phe Asp 
1100 1105 1110 



Asn Leu Glu Lys Tyr Thr Asp Tyr He Leu Lys He Thr Pro Ser 
1115 1120 1125 



Thr Glu Lys Gly Phe Ser Asp Thr Tyr Thr Ala Gin Leu Tyr He 
1130 1135 1140 



Lys Thr Glu Glu Asp Val Pro Glu Thr Ser Pro He lis Asn Thx 
1145 1150 1155 



Phe Lys Asn Leu Ser Ser Thr Ser Val Leu Leu Ser Trp Asp Pro 
1160 1165 1170 



Pro Val Lys Pro Asn Gly Ala He He Ser Tyr Asp Leu Thr Leu 
1175 1180 1185 



Gin Gly Pro Asn Glu Asn Tyr Ser Phe He Thr Ser Asp Asn Tyr 
1190 1195 1200 
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Ile lie Leu Glu Glu Leu Ser Pro Phe Thr Leu Tyr Ser Phe Phe 
1205 1210 1215 



Ala Ala Ala Arg Thr Arg Lys 61y Leu Gly Pro Ser Ser He Leu 
1220 1225 1230 



Phe Phe Tyr Thr Asp Glu Ser Val Pro Leu Ala Pro Pro Gin Asn 
1235 1240 1245 



Leu Thr Leu lie Asn Cys Thr Ser Asp Phe Val Trp Leu Lys Trp 
1250 1255 1260 



Ser Pro Ser Pro Leu Pro Gly Gly lie Val Lys Val Tyr Ser Phe 
1265 1270 1275 



Lys He His Glu His Glu Thr Asp Thr He Tyr Tyr Lys Asn He 
1280 1285 1290 



Ser Gly Phe Lys Thr Glu Ala Lys Leu Val Gly Leu Glu Pro Val 
1295 1300 ~ 1305 



Ser Thr Tyr Ser He Arg Val Ser Ala Phe Thr Lys Val Gly Asn 
1310 1315 1320 



Gly Asn Gin Phe Ser Asn Val Val Lys Phe Thr Thr Gin Glu Ser 
1325 1330 1335 



Val Pro Asp Val Val Gin Asn Met Gin Cys Met Ala Thr Ser Trp 
1340 1345 1350 



Gin Ser Val Leu Val Lys Trp Asp Pro Pro Lys Lys Ala Asn Gly 
1355 1360 1365 



He He Thr Gin Tyr Met Val Thr Val Glu Arg Asn Ser Thr Lys 
1370 1375 1380 



Val Ser Pro Gin Asp His Met Tyr Thr Phe He Lys Leu Leu Ala 
1385 1390 1395 



Asn Thr Ser Tyr Val Phe Lys Val Arg Ala Ser Thr Ser Ala Gly 
1400 ' 1405 1410 



Glu Gly Asp Glu Ser Thr Cys His Val Ser Thr Leu Pro Glu Thr 
1415 1420 1425 
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Val Pro Ser Val Pro Thr Asn He Ala Phe Ser Asp Val Gin Ser 
1430 1435 1440 



Thr Ser Ala Thr Leu Thr Trp He Arg Pro Asp Thr He Leu Gly 
1445 1450 1455 



Tyr Phe Gin Asn Tyr Lys He Thr Thr Gin Leu Arg Ala Gin Lys 
1460 1465 1470 



Cys Lys Glu Trp Glu Ser Glu Glu Cys Val Glu Tyr Gin Lys He 
1475 1480 1485 



Gin Tyr Leu Tyr Glu Ala His Leu Thr Glu Glu Thr Val Tyr Gly 
1490 1495 1500 



Leu Lys Lys Phe Arg Trp Tyr Arg Phe Gin Val Ala Ala Ser Thr 
1505 " 1510 1515 



Asn Ala Gly Tyr Gly Asn Ala Ser Asn Trp He Ser Thr Lys Thr 
1520 1525 1530 



Leu Pro Gly Pro Pro Asp Gly Pro Pro Glu Asn Val His Val Val 
1535 1540 1545 



Ala Thr Ser Pro Phe Ser He Ser He Ser Trp Ser Glu Pro Ala 
1550 1555 1560 



Val He Thr Gly Pro Thr Cys Tyr Leu He Asp Val Lys Ser Val 
1565 1570 1575 



Asp Asn Asp Glu Phe Asn lie Ser Phe He Lys Ser Asn Glu Glu 
1580 1585 1590 



ksn Lys Thr He Glu He Lys Asp Leu Glu lie Phe Thr Arg Tyr 
1595 1600 1605 



er Val Val He Thr Ala Phe Thr Gly Asn He Ser Ala Ala Tyr 
1610 1615 1620 



a Glu Gly Lys Ser Ser Ala Glu Met He Val Thr Thr Leu Glu 
1625 1630 1635 
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Ser Ala Pro Lys Asp Pro Pro Asn Asn Met Thr Phe Gin Lys lie 
1640 1645 1650 



Pro Asp Glu Val Thr Lys Phe Gin Leu Thr Phe Leu Pro Pro Ser 
1655 1660 1665 



Gin Pro Asn Gly Asn lie Gin Val Tyr Gin Ala Leu Val Tyr Arg 
1670 1675 1680 



Glu Asp Asp Pro Thr Ala Val Gin lie His Asn Leu Ser lie lie 
1685 1690 1695 



Gin Lys Thr Asn Thr Phe Val He Ala Met Leu Glu Gly Leu Lys 
1700 1705 1710 



Gly Gly His Thr Tyr Asn He Ser Val Tyr Ala Val Asn Ser Ala 
1715 1720 1725 



Gly Ala Gly Pro Lys Val Pro Met Arg He Thr Met Asp He Lys 
1730 1735 1740 



Ala Pro Ala Arg Pro Lys Thr Lys Pro Thr Pro He Tyr Asp Ala 
1745 1750 1755 



Thr Gly Lys Leu Leu Val Thr Ser Thr Thr He Thr He Arg Met 
1760 1765 1770 



Pro He Cys Tyr Tyr Ser Asp Asp His Gly Pro He Lys Asn Val 
1775 1780 1785 



Gin Val Leu Val Thr Glu Thr Gly Ala Gin His Asp Gly Asn Val 
1790 1795 1800 



Thr Lys Trp Tyr Asp Ala Tyr Phe Asn Lys Ala Arg Pro Tyr Phe 
1805 1810 1815 



Thr Asn Glu Gly Phe Pro Asn Pro Pro Cys Thr Glu Gly Lys Thr 
1820 1825 ~ 1830 



Lys Phe Ser Gly Asn Glu Glu He Tyr He He Gly Ala Asp Asn 
1835 1840 1845 



Ala Cys Met He Pro Gly Asn Glu Asp Lys He Cys Asn Gly Pro 
1850 1855 1860 
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Leu Lys Pro Lys Lys Gin Tyr Leu Phe Lys Phe Arg Ala Thr Asn 
1865 1870 ^ 1875 



lie Met Gly Gin Phe Thr Asp Ser Asp Tyr Ser Asp Pro Val Lys 
1880 1885 1890 



Thr Leu Gly Glu Gly Leu Ser Glu Arg Thr Val Glu lie lie Leu 
1895 1900 1905 



Ser Val Thr Leu Cys He Leu Ser He He Leu Leu Gly Thr Ala 
1910 1915 1920 



He Phe Ala Phe Ala Arg He Arg Gin Lys Gin Lys Glu Gly Gly 
1925 1930 1935 



Thr Tyr Ser Pro Gin Asp Ala Glu He He Asp Thr Lys Leu Lys 
1940 1945 1950 



Leu Asp Gin Leu He Thr Val Ala Asp Leu Glu Leu Lys Asp Glu 
1955 1960 1965 



Arg Leu Thr Arg Leu Leu Ser Tyr Arg Lys Ser He Lys Pro He 
1970 1975 ~ 1980 



Ser Lys Lys Ser Phe Leu Gin His Val Glu Glu Leu Cys Thr Asn 
1985 1990 1995 



Asn Asn Leu Lys Phe Gin Glu Glu Phe Ser Glu Leu Pro Lys Phe 
2000 2005 2010 



Leu Gin Asp Leu Ser Ser Thr Asp Ala Asp Leu Pro Trp Asn Arg 
2015 2020 2025 



Ala Lys Asn Arg Phe Pro Asn He Lys Pro Tyr Asn Asn Asn Arg 
2030 2035 2040 



Val Lys Leu He Ala Asp Ala Ser Val Pro Gly Ser Asp Tyr He 
2045 ' 2050 2055 



Asn Ala Ser Tyr He Ser Gly Tyr Leu Cys Pro Asn Glu Phe He 
2060 2065 2070 



Ala Thr Gin Gly Pro Leu Pro Gly Thr Val Gly Asp Phe Trp Lys 
2075 2080 2085 
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Met Val Trp Glu Thr Arg Ala Lys Thr Leu Val Met Leu Thr Gin 
2090 2095 2100 



Cys Phe Glu Lys Gly Arg lie Arg Cys His Gin Tyr Trp Pro Glu 
2105 2110 2115 



Asp Asn Lys Pro Val Thr Val Phe Gly Asp lie Val lie Thr Lys 
2120 2125 2130 



Leu Met Glu Asp Val Gin lie Asp Trp Thr lie Arg Asp Leu Lys 
2135 2140 2145 



lie Glu Arg His Gly Asp Cys Met Thr Val Arg Gin Cys Asn Phe 
2150 2155 2160 



Thr Ala Trp Pro Glu His Gly Val Pro Glu Asn Ser Ala Pro Leu 
2165 2170 2175 



lie His Phe Val Lys Leu Val Arg Ala Ser Arg Ala His Asp Thr 
2180 2185 2190 



Thr Pro Met He Val His Cys Ser Ala Gly Val Gly Arg Thr Gly 
2195 2200 2205 



Val Phe He Ala Leu Asp His Leu Thr Gin His He Asn Asp His 
2210 2215 2220 



Asp Phe Val Asp He Tyr Gly Leu Val Ala Glu Leu Arg Ser Glu 
2225 2230 2235 



.Arg Met Cys Met Val Gin Asn Leu Ala Gin Tyr He Phe Leu His 
2240 2245 2250 



Gin Cys He Leu Asp Leu Leu Ser Asn Lys Gly Ser Asn Gin Pro 
2255 2260 2265 



He Cys Phe Val Asn Tyr Ser Ala Leu Gin Lys Met Asp Ser Leu 
2270 2275 2280 



Asp Ala Met Glu Gly Asp Val Glu Leu Glu Trp Glu Glu Thr Thr 
2285 2290 2 295 
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Met 

<210> 3 

<211> 2302 

<212> PRT 

<213> Homo sapiens 



<400> 3 

Met Met Asp Phe His Phe Ser Phe Leu Phe Leu Leu lie Gly Thr Ser 
15 10 15 



Glu Ser Gin Val Asp Val Ser Ser Ser Phe Asp Gly Thr Gly Tyr Asp 
20 25 30 



lie Thr Leu Ser Ser Val Ser Ala Thr Thr Tyr Ser Ser Pro Val Ser 
35 40 45 



Arg Thr Leu Ala Thr Asn Val Thr Lys Pro Gly Pro Pro Val Phe Leu 
50 55 60 



Ala Gly Glu Arg Val Gly Ser Ala Gly lie Leu Leu Ser Trp Asn Thr 
65 70 75 80 



Pro Pro Asn Pro Asn Gly Arg lie lie Ser Tyr Val Val Lys Tyr Lys 
85 ^ 90 ~ ~ 95 



Glu Val Cys Pro Trp Met Gin Thr Ala Tyr Thr Arg Ala Arg Ala Lys 
100 105 110 



Pro Asp Ser Leu Glu Val Leu Leu Thr Asn Leu Asn Pro Gly Thr Thr 
115 120 125 



Tyr Glu lie Lys Val Ala Ala Glu Asn Asn Ala Gly He Gly Val Phe 
130 135 140 



Ser Asp Pro Phe Leu Phe Gin Thr Ala Glu Ser Ala Pro Gly Lys Val 
145 150 155 160 



Val Asn Leu Thr Val Glu Ala Leu Asn Tyr Ser Ala Val Asn Leu He 
165 170 175 
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Trp Tyr Leu Pro Arg Gin Pro Asn Gly Lys lie Thr Ser Phe Lys lie 
180 185 190 



Ser Val Lys His Ala Arg Ser Gly lie Val Val Lys Asp Val Ser Leu 
195 " 200 " 205 



Arg Val Glu Asp lie Leu Ser Gly Lys Leu Pro Glu Cys Asn Glu Asn 
210 215 220 



Ser Glu Ser Phe Leu Trp Ser Thr Thr Ser Pro Ser Pro Thr Leu Gly 
225 230 235 240 



Arg Val Thr Pro Thr Val Arg Thr Thr Gin Ser Ser Ser Thr Ala Ala 
245 250 255 



Arg Ser Lys lie Ser Ser Val Trp Lys Glu Pro lie Ser Phe Val Val 
260 265 270 



Thr His Leu Arg Pro Tyr Thr Thr Tyr Leu Phe Glu Val Ser Ala Val 
275 280 285 



Thr Thr Glu Ala Gly Tyr lie Asp Ser Thr lie Val Arg Thr Pro Glu 
290 295 300 



Ser Val Pro Glu Gly Pro Pro Gin Asn Cys lie Met Gly Asn Val Thr 
305 310 315 " 320 



Gly Lys Ala Phe Ser lie Ser Trp Asp Pro Pro Thr lie Val Thr Gly 
325 * 330 335 



Lys Phe Ser Tyr Arg Val Glu Leu Tyr Gly Pro Ser Gly Arg lie Leu 
340 345 350 



Asp Asn Ser Thr Lys Asp Leu Arg Phe Ala Phe Thr His Leu Thr Pro 
355 360 365 



Phe Thr Met Tyr Asp Val Tyr Val Ala Ala Glu Thr Ser Ala Gly Val 
370 375 380 



Gly Pro Lys Ser Asn Leu Ser Val Phe Thr Pro Pro Asp Val Pro Gly 
385 390 395 400 
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Ala Val Phe Asp Leu Gin lie Ala Glu Val Glu Ala Thr Glu lie Arg 
405 410 415 



He Thr Trp Arg Lys Pro Arg Gin Pro Asn Gly lie He Ser Gin Tyr 
420 425 430 



Arg Val Lys Val Ser Val Leu Glu Thr Gly Val Val Leu Glu Asn Thr 
435 440 445 



Leu Leu Thr Gly Gin Asp Glu Ser He Ser Asn Pro Met Ser Pro Glu 
450 455 460 



He Met Asn Leu Val Asp Pro Met He Gly Phe Tyr Glu Gly Ser Gly 
465 470 475 ^ 480 



Glu Met Ser Ser Asp Leu His Ser Pro Ala Ser Phe He Tyr Asn Ser 
485 490 495 



His Pro His Asn Asp Phe Pro Ala Ser Thr Arg Ala Glu Glu Gin Ser 
500 505 510 



Ser Pro Val Val Thr Thr Arg Asn Gin Tyr Met Thr Asp He Thr Ala 
515 520 525 



Glu Gin Leu Ser Tyr Val Val Arg Arg Leu Val Pro Phe Thr Glu His 
530 535 540 



Thr He Ser Val Ser Ala Phe Thr He Met Gly Glu Gly Pro Pro Thr 
545 550 555 560 



Val Leu Thr Val Arg Thr Arg Glu Gin Val Pro Ser Ser He Gin lie 
565 570 575 



He Asn Tyr Lys Asn He Ser Ser Ser Ser He Leu Leu Tyr Trp Asp 
580 585 590 



Pro Pro Glu Tyr Pro Asn Gly Lys He Thr His Tyr Thr He Tyr Ala 
595 600 605 



Thr Glu Leu Asp Thr Asn Arg Ala Phe Gin Met Thr Thr Val Asp Asn 
610 615 620 



Ser Phe Leu He Thr Gly Leu Lys Lys Tyr Thr Arg Tyr Lys Met Arg 
625 630 635 640 
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Val Ala Ala Ser Thr His Val Gly Glu Ser Ser Leu Ser Glu Glu Asn 
645 650 655 



Asp lie Phe Val Arg Thr Pro Glu Asp Glu Pro Glu Ser Ser Pro Gin 
660 665 670 



Asp Val Gin Val Thr Gly Val Ser Pro Ser Glu Leu Arg Leu Lys Trp 
675 680 685 



Ser Pro Pro Glu Lys Pro Asn Gly lie lie lie Ala Tyf Glu Val Leu 
690 695 700 



Tyr Gin Asn Ala Asp Thr Leu Phe Val Lys Asn Thr Ser Thr Thr Asp 
705 710 715 720 



He He He Ser Asp Leu Lys Pro Tyr Thr Leu Tyr Asn He Ser He 
725 730 735 



Arg Ser Tyr Thr Arg Leu Gly His Gly Asn Gin Ser Ser Ser Leu Leu 
740 745 750 



Ser Val Arg Thr Ser Glu Thr Val Pro Asp Ser Ala Pro Glu Asn He 
755 760 765 



Thr Tyr Lys Asn He Ser Ser Gly Glu He Glu He Ser Phe Leu Pro 
770 775 780 



Pro Arg Ser Pro Asn Gly He He Gin Lys Tyr Thr He Tyr Leu Lys 
785 790 795 800 



Arg Ser Asn Ser His Glu Ala Arg Thr He Asn Thr Thr Ser Leu Thr 
805 810 815 



Gin Thr He Gly Gly Leu Lys Lys Tyr Thr His Tyr Val He Glu Val 
820 825 830 



Ser Ala Ser Thr Leu Lys Gly Glu Gly He Arg Ser Arg Pro He Ser 
835 840 845 



He Leu Thr Glu Glu Asp Ala Pro Asp Ser Pro Pro Gin Asn Phe Ser 
850 855 860 



Val Lys Gin Leu Ser Gly Val Thr Val Met Leu Ser Trp Gin Pro Pro 
865 870 875 880 
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Leu Glu Pro Asn Gly He He Leu Tyr Tyr Thr Val Tyr Val Trp Asp 
885 890 ^ 895 



Lys Ser Ser Leu Arg Ala He Asn Ala Thr Glu Ala Ser Leu Val Leu 
900 905 910 



Ser Asp Leu Asp Tyr Asn Val Asp Tyr Gly Ala Cys Val Thr Ala Ser 
915 920 925 



Thr Arg Phe Gly Asp Gly Asn Ala Arg Ser Ser He He Asn Phe Arg 
930 935 940 



Thr Pro Glu Gly Glu Pro Ser Asp Pro Pro Asn Asp Val His Tyr Val 
945 950 955 ~ 960 



Asn Leu Ser Ser Ser Ser He He Leu Phe Trp Thr Pro Pro Val Lys 
965 970 975 



Pro Asn Gly He He Gin Tyr Tyr Ser Val Tyr Tyr Gin Asn Thr Ser 
980 985 ~ 990 

Gly Thr Phe Val Gin Asn Phe Thr Leu Leu Gin Val Thr Lys Glu Ser 
995 1000 1005 



Asp Asn Val Thr Val Ser Ala Arg He Tyr Arg Leu Ala lie Phe 
1010 1015 1020 



Ser Tyr Tyr Thr Phe Trp Leu Thr Ala Ser Thr Ser Val Gly Asn 
1025 1030 1035 



Gly Asn Lys Ser Ser Asp He He His Val Tyr Thr Asp Gin Asp 
1040 1045 1050 



He Pro Glu Gly Pro Val Gly Asn Leu Thr Phe Glu Ser He Ser 
1055 1060 1065 



Ser Thr Ala He His Val Ser Trp Glu Pro Pro Ser Gin Pro Asn 
1070 1075 1080 



Gly Leu Val Phe Tyr Tyr Leu Ser Leu Asn Leu Gin Gin Ser Pro 
1085 1090 1095 
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Pro Arg His Met He Pro Pro Leu Val Thr Tyr Glu Asn Ser He 
1100 1105 1110 



Asp Phe Asp Asp Leu Glu Lys Tyr Thr Asp Tyr He Phe Lys He 
1115 1120 1125 



Thr Pro Ser Thr Glu Lys Gly Phe Ser Glu Thr Tyr Thr Thr Gin 
1130 1135 1140 



Leu His He Lys Thr Glu Glu Asp Val Pro Asp Thr Pro Pro He 
1145 1150 1155 



He Asn Thr Phe Lys Asn Leu Ser Ser Thr Ser He Leu Leu Ser 
1160 1165 1170 



Trp Asp Pro Pro Leu Lys Pro Asn Gly Ala He Leu Gly Tyr His 
1175 1180 1185 



Leu Thr Leu Gin Gly Pro His Ala Asn His Thr Phe Val Thr Ser 
1190 1195 1200 



Gly Asn His He Val Leu Glu Glu Leu Ser Pro Phe Thr Leu Tyr 
1205 1210 1215 



Ser Phe Phe Ala Ala Ala Arg Thr Met Lys Gly Leu Gly Pro Ser 
1220 1225 1230 



Ser He Leu Phe Phe Tyr Thr Asp Glu Ser Ala Pro Leu Ala Pro 
1235 1240 1245 



Pro Gin Asn Leu Thr Leu He Asn Tyr Thr Ser Asp Phe Val Trp 
1250 1255 1260 



Leu Thr Trp Ser Pro Ser Pro Leu Pro Gly Gly He Val Lys Val 
1265 1270 1275 



Tyr Ser Phe Lys He His Glu His Glu Thr Asp Thr Val Phe Tyr 
1280 1285 1290' 



Lys Asn He Ser Gly Leu Gin Thr Asp Ala Lys Leu Glu Gly Leu 
1295 1300 1305 



Glu Pro Val Ser Thr Tyr Ser Val Ser Val Ser Ala Phe Thr Lys 
1310 1315 1320 
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Val Gly Asn Gly Asn Gin Tyr Ser Asn Val Val Glu Phe Thr Thr 
1325 1330 1335 



Gin Glu Ser Val Pro Glu Ala Val Arg Asn He Glu Cys Val Ala 
1340 1345 1350 



Arg Asp Trp Gin Ser Val Ser Val Arg Trp Asp Pro Pro Arg Lys 
1355 1360 " ~ 1365 



Thr Asn Gly He He He His Tyr Met He Thr Val Gly Gly Asn 
1370 1375 1380 



Ser Thr Lys Val Ser Pro Arg Asp Pro Thr Tyr Thr Phe Thr Lys 
1385 1390 ~ 1395 



Leu Leu Pro Asn Thr Ser Tyr Val Phe Glu Val Arg Ala Ser Thr 
1400 1405 1410 



Ser Ala Gly Glu Gly Asn Glu Ser Arg Cys Asp He Ser Thr Leu 
1415 1420 1425 



Pro Glu Thr Val Pro Ser Ala Pro Thr Asn Val Ala Phe Ser Asn 
1430 1435 1440 



Val Gin Ser Thr Ser Ala Thr Leu Thr Trp Thr Lys Pro Asp Thr 
1445 1450 1455 



He Phe Gly Tyr Phe Gin Asn Tyr Lys He Thr Thr Gin Leu Arg 
1460 1465 ~ 1470 



Ala Gin Lys Cys Arg Glu Trp Glu Pro Glu Glu Cys He Glu His 
1475 1480 1485 



Gin Lys Asp Gin Tyr Leu Tyr Glu Ala Asn Gin Thr Glu Glu Thr 
1490 1495 1500 



Val His Gly Leu Lys Lys Phe Arg Trp Tyr Arg Phe Gin Val Ala 
1505 1510 " 1515 



Ala Ser Thr Asn Val Gly Tyr Ser Asn Ala Ser Glu Trp He Ser 
1520 1525 1530 
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Thr Gin Thr Leu Pro Gly Pro Pro Asp Gly Pro Pro Glu Asn Val 
1535 1540 1545 



His Val Val Ala Thr Ser Pro Phe Gly lie Asn lie Ser Trp Ser 
1550 1555 ~ 1560 



Glu Pro Ala Val lie Thr Gly Pro Thr Phe Tyr Leu lie Asp Val 
1565 1570 1575 



Lys Ser Val Asp Asp Asp Asp Phe Asn He Ser Phe Leu Lys Ser 
1580 1585 1590 



Asn Glu Glu Asn Lys Thr Thr Glu He Asn Asn Leu Glu Val Phe 
1595 1600 1605 



Thr Arg Tyr Ser Val Val He Thr Ala Phe Val Gly Asn Val Ser 
1610 1615 1620 



Arg Ala Tyr Thr Asp Gly Lys Ser Ser Ala Glu Val He He Thr 
1625 1630 1635 



Thr Leu Glu Ser Val Pro Lys Asp Pro Pro Asn Asn Met Thr Phe 
1640 1645 1650 



Gin Lys He Pro Asp Glu Val Thr Lys Phe Gin Leu Thr Phe Leu 
1655 1660 1665 



Pro Pro Ser Gin Pro Asn Gly Asn He Arg Val Tyr Gin Ala Leu 
1670 1675 " 1680 



Val Tyr Arg Glu Asp Asp Pro Thr Ala Val Gin He His Asn Phe 
1685 1690 1695 



Ser He He Gin Lys Thr Asp Thr Ser He He Ala Met Leu Glu 
1700 1705 1710 



Gly Leu Lys Gly Gly His Thr Tyr Asn He Ser Val Tyr Ala He 
1715 1720 1725 



Asn Ser Ala Gly Ala Gly Pro Lys Val Gin Met Arg He Thr Met 
1730 1735 1740 



Asp He Lys Ala Pro Ala Arg Pro Lys Ser Lys Pro He Pro He 
1745' 1750 1755 
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Arg Asp Ala Thr Gly Lys Leu Leu Val Thx Ser Thr Thr lie Thr 
1760 1765 1770 



lie Arg Met Pro lie Cys Tyr Tyr Asn Asp Asp His Gly Pro lie 
1775 1780 ~ " 1785 



Arg Asn Val Gin Val Leu Val Ala Glu Thr Gly Ala Gin Gin Asp 
1790 1795 1800 



Gly Asn Val Thr Lys Trp Tyr Asp Ala Tyr Phe Asn Lys Ala Arg 
1805 1810 1815 



Pro Tyr Phe Thr Asn Glu Gly Phe Pro Asn Pro Pro Cys lie Glu 
1820 1825 1830 



Gly Lys Thr Lys Phe Ser Gly Asn Glu Glu lie Tyr Val lie Gly 
1835 1840 1845 



Ala Asp Asn Ala Cys Met lie Pro Gly Asn Glu Glu Lys He Cys 
1850 1855 1860 



Asn Gly Pro Leu Lys Pro Lys Lys Gin Tyr Leu Phe Lys Phe Arg 
1865 1870 1875 



Ala Thr Asn Val Met Gly Gin Phe Thr Asp Ser Glu Tyr Ser Asp 
1880 1885 * 1890 



Pro He Lys Thr Leu Gly Glu Gly Leu Ser Glu Arg Thr Val Glu 
1895 1900 1905 



He He Leu Ser Val Thr Leu Cys He Leu Ser He He Leu Leu 
1910 1915 1920 



Gly Thr Ala He Phe Ala Phe Val Arg He Arg Gin Lys Gin Lys 
1925 1930 1935 



Glu Gly Gly Thr Tyr Ser Pro Arg Asp Ala Glu He He Asp Thr 
1940 1945 1950 



Lys Phe Lys Leu Asp Gin Leu He Thr Val Ala Asp Leu Glu Leu 
1955 I960 1965 
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Lys Asp Glu Arg Leu Thr Arg Leu Leu Ser Tyr Arg Lys Ser lie 
1970 1975 1980 



Lys Pro lie Ser Lys Lys Ser Phe Leu Gin His Val Glu Glu Leu 
1985 1990 1995 



Cys Thr Asn Ser Asn Leu Lys Phe Gin Glu Glu Phe Ser Glu Leu 
2000 2005 2010 



Pro Lys Phe Leu Gin Asp Leu Ser Ser Thr Asp Ala Asp Leu Pro 
2015 2020 2025 



Trp Asn Arg Ala Lys Asn Arg Phe Pro Asn He Lys Pro Tyr Asn 
2030 2035 2040 



Asn Asn Arg Val Lys Leu He Ala Asp Val Ser Leu Pro Gly Ser 
2045 2050 2055 



Asp Tyr He Asn Ala Ser Tyr Val Ser Gly Tyr Leu Cys Pro Asn 
2060 2065 2070 



Glu Phe He Ala Thr Gin Gly Pro Leu Pro Gly Thr Val Gly Asp 
2075 2080 2085 



Phe Trp Arg Met Val Trp Glu Thr Arg Thr Lys Thr Leu Val Met 
2090 2095 2100 



Leu Thr Gin Cys Phe Glu Lys Gly Arg He Arg Cys His Gin Tyr 
2105 2110 2115 



Trp Pro Glu Asp Asn Lys Pro Val Thr Val Phe Gly Asp He Val 
2120 2125 2130 



He Thr Lys Leu Met Glu Asp He Gin He Asp Trp Thr He Arg 
2135 2140 ■ 2145 



Asp Leu Lys He Glu Arg His Gly Asp Cys Met Thr Val Arg Gin 
2150 2155 2160 



Cys Asn Phe Thr Gly Trp Pro Glu His Gly Val Pro Glu Asn Thr 
2165 2170 2175 



Thr Pro Leu He His Phe Val Lys Leu Val Arg Thr Ser Arg Ala 
2180 2185 2190 
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His Asp Thr Thr Pro Met Val Val His Cys Ser Ala Gly Val Gly 
2195 2200 2205 



Arg Thr Gly Val Phe lie Ala Leu Asp His Leu Thr Gin His He 
2210 2215 2220 



Asn Asn His Asp Phe Val Asp He Tyr Gly Leu Val Ala Glu Leu 
2225 2230 2235 



Arg Ser Glu Arg Met Cys Met Val Gin Asn Leu Ala Gin Tyr He 
2240 2245 2250 



Phe Leu His Gin Cys He Leu Asp Leu Leu Ser Asn Lys Gly Gly 
2255 2260 2265 



His Gin Pro Val Cys Phe Val Asn Tyr Ser Thr Leu Gin Lys Met 
2270 2275 2280 



Asp Ser Leu Asp Ala Met Glu Gly Asp Val Glu Leu Glu Trp Glu 
2285 2290 2295 



Glu Thr Thr Met 
2300 



<210> 4 

<211> 330 

<212> PRT 

<213> Homo sapiens 

<400> 4 

Met Asp Phe Leu He He Phe Leu Leu Leu Phe He Gly Thr Ser Glu 
15 10 15 



Thr Gin Val Asp Val Ser Asn Val Val Pro Gly Thr Arg Tyr Asp He 
20 25 30 



Thr He Ser Ser He Ser Thr Thr Tyr Thr Ser Pro Val Thr Arg He 
35 40 45 



Val Thr Thr Asn Val Thr Lys Pro Gly Pro Pro Val Phe Leu Ala Gly 
50 55 60 
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Glu Arg Val Gly Ser Ala Gly He Leu Leu Ser Trp Asn Thr Pro Pro 
65 70 75 80 



Asn Pro Asn Gly Arg He He Ser Tyr He Val Lys Tyr Lys Glu Val 
85 90 ~ 95 



Cys Pro Trp Met Gin Thr Val Tyr Thr Gin Val Arg Ser Lys Pro Asp 
100 105 HQ 



Ser Leu Glu Val Leu Leu Thr Asn Leu Asn Pro Gly Thr Thr Tyr Glu 
115 120 125 



He Lys Val Ala Ala Glu Asn Ser Ala Gly He Gly Val Phe Ser Asp 
130 135 140 



Pro Phe Leu Phe Gin Thr Ala Glu Ser Ala Pro Gly Lys Val Val Asn 
145 150 155 ~ ^ 160 



Leu Thr Val Glu Ala Tyr Asn Ala Ser Ala Val Lys Leu He Trp Tyr 
165 170 175 



Leu Pro Arg Gin Pro Asn Gly Lys He Thr Ser Phe Lys He Ser Val 
180 185 190 



Lys His Ala Arg Ser Gly He Val Val Lys Asp Val Ser He Arg Val 
195 200 ~ 205 



Glu Asp He Leu Thr Gly Lys Leu Pro Glu Ser Asn Glu Asn Ser Glu 
210 215 220 



Ser Phe Leu Trp Ser Thr Ala Ser Pro Ser Pro Thr Leu Gly Arg Val 
225 230 235 240 



Thr Pro Pro Ser Arg Thr Thr His Ser Ser Ser Thr Leu Thr Gin Asn 
245 250 255 



Glu He Ser Ser Val Trp Lys Glu Pro He Ser Phe Val Val Thr His 
260 265 270 



Leu Arg Pro Tyr Thr Thr Tyr Leu Phe Glu Val Ser Ala Ala Thr Thr 
275 280 285 



Glu Ala Gly Tyr He Asp Ser Thr He Val Arg Thr Pro Glu Ser Ala 
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290 295 300 



Thr Ala Ala Gly Thr Gin Ala Asn Arg Val Trp Ser Arg Pro Leu Ala 
305 310 315 320 



Asn Ser Asn Arg Pro Ala Ala Glu Gly Pro 
325 330 



<210> 5 

<211> 1959 

<212> PRT 

<213> Homo sapiens 

<400> 5 

Leu Leu Gly Gly Lys Leu Thr Asn Arg Lys Asp lie His Thr Lys Asn 
15 10 15 



Pro Ser Val His His His His Gin Arg Pro Lys Val Asp Lys Thr Thr 
20 25 30 



Lys Met Gly Lys Lys Gin Ser Arg Lys Thr Gly Asn Ser Lys Lys Gin 
35 40 45 



Ser Thr Ser Pro Pro Pro Lys Asp Gly Arg Ala Val Phe Asp Leu Gin 
50 55 60 



Leu Ala Glu Val Glu Ser Thr Gin Val Arg Xle Thr Trp Lys Lys Pro 
65 70 75 80 



Arg Gin Pro Asn Gly He He Asn Gin Tyr Arg Val Lys Val Leu Val 
85 90 95 



Pro Glu Thr Gly He He Leu Glu Asn Thr Leu Leu Thr Gly Asn Asn 
100 105 110 



Glu Tyr He Asn Asp Pro Met Ala Pro Glu He Val Asn He Val Glu 
115 120 125 



Pro Met Val Gly Leu Tyr Glu Gly Ser Ala Glu Met Ser Ser Asp Leu 
130 135 140 



WO 03/033688 



PCT/EP02/11473 



•29- 



His Ser Leu Ala Thr Phe lie Tyr Asn Ser His Pro Asp Lys Asn Phe 
145 150 155 160 



Pro Ala Arg Asn Arg Ala Glu Asp Gin Thr Ser Pro Val Val Thr Thr 
165 170 175 



Arg Asn Gin Tyr He Thr Asp He Ala Ala Glu Gin Leu Ser Tyr Val 
180 185 190 



He Arg Arg Leu Val Pro Phe Thr Glu His Met He Ser Val Ser Ala 
195 200 205 



Phe Thr He Met Gly Glu Gly Pro Pro Thr Val Leu Ser Val Arg Thr 
210 215 220 



Arg Gin Gin Val Pro Ser Ser He Lys He He Asn Tyr Lys Asn He 
225 230 235 " 240 

Ser Ser Ser Ser He Leu Leu Tyr Trp Asp Pro Pro Glu Tyr Pro Asn 
245 250 255 

Gly Lys He Thr His Tyr Thr He Tyr Ala Met Glu Leu Asp Thr Asn 
260 265 270 

Arg Ala Phe Gin He Thr Thr He Asp Asn Ser Phe Leu He Thr Gly 
275 280 285 



Leu Lys Lys Tyr Thr Lys Tyr Lys Met Arg Val Ala Ala Ser Thr His 
290 295 300 



Val Gly Glu Ser Ser Leu Ser Glu Glu Asn Asp He Phe Val Arg Thr 
305 310 315 320 

Ser Glu Asp Glu Pro Glu Ser Ser Pro Gin Asp Val Glu Val He Asp 
325 330 335 

Val Thr Ala Asp Glu He Arg Leu Lys Trp Ser Pro Pro Glu Lys Pro 
340 345 350 



Asn Gly He He He Ala Tyr Glu Val Leu Tyr Lys Asn He Asp Thr 
355 360 365 



Leu Tyr Met Lys Asn Thr Ser Thr Thr Asp He He Leu Arg Asn Leu 
370 375 380 



WO 03/033688 



PCT/EP02/11473 



■30- 



Arg Pro His Thr Leu Tyr Asn He Ser Val Arg Ser Tyr Thr Arg Phe 
385 390 395 400 



Gly His Gly Asn Gin Val Ser Ser Leu Leu Ser Val Arg Thr Ser Glu 
405 410 415 



Thr Val Pro Asp Ser Ala Pro Glu Asn He Thr Tyr Lys Asn He Ser 
420 425 430 



Ser Gly Glu He Glu Leu Ser Phe Leu Pro Pro Ser Ser Pro Asn Gly 
435 440 445 



He He Gin Lys Tyr Thr He Tyr Leu Lys Arg Ser Asn Gly Asn Glu 
450 ~ 455 460 



Glu Arg Thr He Asn Thr Thr Ser Leu Thr Gin Asn He Lys Val Leu 
465 470 475 480 



Lys Lys Tyr Thr Gin Tyr He He Glu Val Ser Ala Ser Thr Leu Lys 
485 490 495 



Gly Glu Gly Val Arg Ser Ala Pro He Ser He Leu Thr Glu Glu Asp 
500 505 510 



Ala Pro Asp Ser Pro Pro Gin Asp Phe Ser Val Lys Gin Leu Ser Gly 
515 520 525 



Val Thr Val Lys Leu Ser Trp Gin Pro Pro Leu Glu Pro Asn Gly He 
530 ~ 535 540 



He Leu Tyr Tyr Thr Val Tyr Val Trp Asn Arg Ser Ser Leu Lys Thr 
545 550 555 560 



He Asn Val Thr Glu Thr Ser Leu Glu Leu Ser Asp Leu Asp Tyr Asn 
565 570 575 



Val Glu Tyr Ser Ala Tyr Val Thr Ala Ser Thr Arg Phe Gly Asp Gly 
580 585 590 



Lys Thr Arg Ser Asn He He Ser Phe Gin Thr Pro Glu Gly Ala Pro 
595 600 605 



Ser Asp Pro Pro Lys Asp Val Tyr Tyr Ala Asn Leu Ser Ser Ser Ser 
610 615 620 
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Ile II© Leu Phe Trp Thr Pro Pro Ser Lys Pro Asn Gly He He Gin 
625 " 630 635 640 



Tyr Tyr Ser Val Tyr Tyr Arg Asn Thr Ser Gly Thr Phe Met Gin Asn 
645 650 655 



Phe Thr Leu His Glu Val Thr Asn Asp Phe Asp Asn Met Thr Val Ser 
660 665 670 



Thr He He Asp Lys Leu Thr He Phe Ser Tyr Tyr Thr Phe Trp Leu 
675 680 685 



Thr Ala Ser Thr Ser Val Gly Asn Gly Asn Lys Ser Ser Asp He He 
690 695 700 



Glu Val Tyr Thr Asp Gin Asp He Pro Glu Gly Phe Val Gly Asn Leu 
705 710 715 720 



Thr Tyr Glu Ser He Ser Ser Thr Ala He Asn Val Ser Trp Val Pro 
725 730 735 



Pro Ala Gin Pro Asn Gly Leu Val Phe Tyr Tyr Val Ser Leu He Leu 
740 745 750 



Gin Gin Thr Pro Arg His Val Arg Pro Pro Leu Val Thr Tyr Glu Arg 
755 760 765 



Ser He Tyr Phe Asp Asn Leu Glu Lys Tyr Thr Asp Tyr He Leu Lys 
770 775 780 



He Thr Pro Ser Thr Glu Lys Gly Phe Ser Asp Thr Tyr Thr Ala Gin 
785 790 795 800 



Leu Tyr He Lys Thr Glu Glu Asp Val Pro Glu Thr Ser Pro He He 
805 810 815 



Asn Thr Phe Lys Asn Leu Ser Ser Thr Ser Val Leu Leu Ser Trp Asp 
820 825 830 



Pro Pro Val Lys Pro Asn Gly Ala He He Ser Tyr Asp Leu Thr Leu 
835 840 845 



WO 03/033688 



PCT/EP02/11473 



-32- 



Gin Gly Pro Asn Glu Asn Tyr Ser Phe lie Thr Ser Asp Asn Tyr lie 
850 855 860 



lie Leu Glu Glu Leu Ser Pro Phe Thr Leu Tyr Ser Phe Phe Ala Ala 
865 870 875 880 



Ala Arg Thr Arg Lys Gly Leu Gly Pro Ser Ser lie Leu Phe Phe Tyr 
885 890 895 



Thr Asp Glu Ser Val Pro Leu Ala Pro Pro Gin Asn Leu Thr Leu lie 
900 905 910 



Asn Cys Thr Ser Asp Phe Val Trp Leu Lys Trp Ser Pro Ser Pro Leu 
915 920 925 



Pro Gly Gly He Val Lys Val Tyr Ser Phe Lys He His Glu His Glu 
930 935 940 



Thr Asp Thr He Tyr Tyr Lys Asn He Ser Gly Phe Lys Thr Glu Ala 
945 950 955 960 



Lys Leu Val Gly Leu Glu Pro Val Ser Thr Tyr Ser He Arg Val Ser 
965 970 975 



Ala Phe Thr Lys Val Gly Asn Gly Asn Gin Phe Ser Asn Val Val Lys 
980 * ~° 985 990 



Phe Thr Thr Gin Glu Ser Val Pro Asp Val Val Gin Asn Met Gin Cys 
995 1000 1005 



Met Ala Thr Ser Trp Gin Ser Val Leu Val Lys Trp Asp Pro Pro 
1010 1015 1020 



Lys Lys Ala Asn Gly He He Thr Gin Tyr Met Val Thr Val Glu 
1025 ~ 1030 1035 



Arg Asn Ser Thr Lys Val Ser Pro Gin Asp His Met Tyr Thr Phe 
1040 1045 1050 



He Lys Leu Leu Ala Asn Thr Ser Tyr Val Phe Lys Val Arg Ala 
1055 1060 1065 



Ser Thr Ser Ala Gly Glu Gly Asp Glu Ser Thr Cys His Val Ser 
1070 ~ 1075 1080 



WO 03/033688 



-33- 



PCT/EP02/11473 



Thr Leu Pro Glu Thr Val Pro Ser Val Pro Thr Asn He Ala Phe 
1085 1090 1095 



Ser Asp Val Gin Ser Thr Ser Ala Thr Leu Thr Trp He Arg Pro 
1100 1105 1110 



Asp Thr He Leu Gly Tyr Phe Gin Asn Tyr Lys He Thr Thr Gin 
1115 1120 1125 



Leu Arg Ala Gin Lys Cys Lys Glu Trp Glu Ser Glu Glu Cys Val 
1130 ~ 1135 1140 



Glu Tyr Gin Lys He Gin Tyr Leu Tyr Glu Ala His Leu Thr Glu 
1145 1150 1155 



Glu Thr Val Tyr Gly Leu Lys Lys Phe Arg Trp Tyr Arg Phe Gin 
1160 1165 1170 



Val Ala Ala Ser Thr Asn Ala Gly Tyr Gly Asn Ala Ser Asn Trp 
1175 1180 *" 1185 



He Ser Thr Lys Thr Leu Pro Gly Pro Pro Asp Gly Pro Pro Glu 
1190 1195 1200 



Asn Val His Val Val Ala Thr Ser Pro Phe Ser He Ser He Ser 
1205 1210 1215 



Trp Ser Glu Pro Ala Val He Thr Gly Pro Thr Cys Tyr Leu He 
1220 1225 1230 



Asp Val Lys Ser Val Asp Asn Asp Glu Phe Asn He Ser Phe He 
1235 1240 1245 



Lys Ser Asn Glu Glu Asn Lys Thr He Glu He Lys Asp Leu Glu 
1250 1255 1260 



He Phe Thr Arg Tyr Ser Val Val He Thr Ala Phe Thr Gly Asn 
1265 1270 1275 



He Ser Ala Ala Tyr Val Glu Gly Lys Ser Ser Ala Glu Met He 
1280 1285 1290 



Val Thr Thr Leu Glu Ser Ala Pro Lys Asp Pro Pro Asn Asn Met 
1295 1300 1305 
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Thr Phe Gin Lys lie Pro Asp Glu Val Thr Lys Phe Gin Leu Thr 
1310 ^ 1315 1320 



Phe Leu Pro Pro Ser Gin Pro Asn Gly Asn He Gin Val Tyr Gin 
1325 1330 1335 



Ala Leu Val Tyr Arg Glu Asp Asp Pro Thr Ala Val Gin He His 
1340 1345 1350 



Asn Leu Ser He He Gin Lys Thr Asn Thr Phe Val He Ala Met 
1355 1360 1365 



Leu Glu Gly Leu Lys Gly Gly His Thr Tyr Asn He Ser Val Tyr 
1370 ~ 1375 1380 



Ala Val Asn Ser Ala Gly Ala Gly Pro Lys Val Pro Met Arg He 
1385 1390 1395 



Thr Met Asp He Lys Ala Pro Ala Arg Pro Lys Thr Lys Pro Thr 
1400 1405 1410 



Pro He Tyr Asp Ala Thr Gly Lys Leu Leu Val Thr Ser Thr Thr 
1415 1420 1425 



He Thr He Arg Met Pro He Cys Tyr Tyr .Ser Asp Asp His Gly 
1430 1435 1440 



Pro He Lys Asn Val Gin Val Leu Val Thr Glu Thr Gly Ala Gin 
1445 1450 1455 



His Asp Gly Asn Val Thr Lys Trp Tyr Asp Ala Tyr Phe Asn Lys 
1460 1465 1470 



Ala Arg Pro Tyr Phe Thr Asn Glu Gly Phe Pro Asn Pro Pro Cys 
1475 ~ 1480 1485 



Thr Glu Gly Lys Thr Lys Phe Ser Gly Asn Glu Glu He Tyr He 
1490 1495 1500 



He Gly Ala Asp Asn Ala Cys Met He Pro Gly Asn Glu Asp Lys 
1505 1510 1515 
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He Cys Asn Gly Pro Leu Lys Pro Lys Lys Gin Tyr Leu Phe Lys 
1520 1525 ~ 1530 



Phe Arg Ala Thr Asn He Met Gly Gin Phe Thr Asp Ser Asp Tyr 
1535 1540 1545 



Ser Asp Pro Val Lys Thr Leu Gly Glu Gly Leu Ser Glu Arg Thr 
1550 1555 1560 



Val Glu He He Leu Ser Val Thr Leu Cys He Leu Ser He He 
1565 1570 1575 



Leu Leu Gly Thr Ala He Phe Ala Phe Ala Arg He Arg Gin Lys 
1580 ~ 1585 1590 



Gin Lys Glu Gly Gly Thr Tyr Ser Pro Gin Asp Ala Glu He He 
1595 1600 ~ 1605 



Asp Thr Lys Leu Lys Leu Asp Gin Leu He Thr Val Ala Asp Leu 
1610 1615 1620 



Glu Leu Lys Asp Glu Arg Leu Thr Arg Leu Leu Ser Tyr Arg Lys 
1625 1630 1635 



Ser He Lys Pro He Ser Lys Lys Ser Phe Leu Gin His Val Glu 
1640 1645 1650 



Glu Leu Cys Thr Asn Asn Asn Leu Lys Phe Gin Glu Glu Phe Ser 
1655 1660 1665 



Glu Leu Pro Lys Phe Leu Gin Asp Leu Ser Ser Thr Asp Ala Asp 
1670 1675 1680 



Leu Pro Trp Asn Arg Ala Lys Asn Arg Phe Pro Asn He Lys Pro 
1685 1690 1695 



Tyr Asn Asn Asn Arg Val Lys Leu He Ala Asp Ala Ser Val Pro 
1700 1705 1710 



Gly Ser Asp Tyr He Asn Ala Ser Tyr He Ser Gly Tyr Leu Cys 
1715 1720 1725 



Pro Asn Glu Phe He Ala Thr Gin Gly Pro Leu Pro Gly Thr Val 
1730 1735 1740 
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Gly Asp Phe Trp Lys Met Val Trp Glu Thr Arg Ala Lys Thr Leu 
1745 " 1750 ~ 1755 



Val Met Leu Thr Gin Cys Phe Glu Lys Gly Arg lie Arg Cys His 
1760 1765 "** 1770 



Gin Tyr Trp Pro Glu Asp Asn Lys Pro Val Thr Val Phe Gly Asp 
1775 ~ 1780 1785 



lie Val lie Thr Lys Leu Met Glu Asp Val Gin lie Asp Trp Thr 
1790 1795 1800 



lie Arg Asp Leu Lys lie Glu Arg His Gly Asp Cys Met Thr Val 
1805 1810 " 1815 



Arg Gin Cys Asn Phe Thr Ala Trp Pro Glu His Gly Val Pro Glu 
1820 ~ 1825 1830 



Asn Ser Ala Pro Leu lie His Phe Val Lys Leu Val Arg Ala Ser 
1835 1840 1845 



Arg Ala His Asp Thr Thr Pro Met lie Val His Cys Ser Ala Gly 
1850 1855 1860 



Val Gly Arg Thr Gly Val Phe lie Ala Leu Asp His Leu Thr Gin 
1865 " 1870 1875 



His lie Asn Asp His Asp Phe Val Asp lie Tyr Gly Leu Val Ala 
1880 1885 1890 



Glu Leu Arg Ser Glu Arg Met Cys Met Val Gin Asn Leu Ala Gin 
1895 1900 1905 



Tyr lie Phe Leu His Gin Cys lie Leu Asp Leu Leu Ser Asn Lys 
1910 1915 " 1920 



Gly Ser Asn Gin Pro lie Cys Phe Val Asn Tyr Ser Ala Leu Gin 
1925 1930 1935 



Lys Met Asp Ser Leu Asp Ala Met Glu Gly Asp Val Glu Leu Glu 
1940 1945 1950 



WO 03/033688 



PCT/EP02/11473 



-37- 

Trp Glu Glu Thr Thr Met 
1955 

<210> 6 

<211> 990 

<212> DMA 

<213> Homo sapiens 



<400> 6 
atggattttc 


ttatcatttt 


tcttttactt 


tttattggga cttcagagac acaggttgat 


60 


gtttccaatg 


tcgttcctgg 


tactaggtac 


gatataacca tctcttcaat ttctacaaca 


120 


tacacctcac 


ctgtfcactag 


aatagtgaca 


acaaatgtaa caaaaccagg gcctccagtc 


180 


ttcctagccg 


gaaaaacraat 


cggatctgct 


gggattcttc tgtcttggaa tacaccacct 


240 


aatccaaatg 


gaaggattat 


atcttacatt 


gtcaaatata aggaagtttg tccgtggatg 


300 


caaacagtat 


atacacaagt 


cagatcaaag 


ccagacagtc tggaagttct tcttactaat 


360 


cttaatcctg 


gaacaacata 


tgaaattaag 


gttgctgctg aaaacagtgc tggcattgga 


420 


gtgtttagtg 


atccatttct 


cttccaaact 


gcagaaagtg ctccaggaaa agtggtgaat 


480 


ctcacagttg 


aggcctacaa 


cgcttcagca 


gttaagctga tttggtattt acctcggcaa 


540 


ccaaatggca 


aaattaccag 


cttcaagatt 


agtgtcaagc atgccagaag tgggatagta 


600 


gtgaaagatg 


tctcaatcag 


agtagaggac 


attttgactg ggaaattgcc agaaagcaat 


660 


gagaatagtg 


aatctttttt 


atggagtaca 


gccagccctt ctccaaccct tggtagagtt 


720 


acacctccat 


cgcgtaccac 


acattcatca 


agcacgttga cacagaatga gatcagctct 


780 


gtgtggaaag 


agcctatcag 


ttttgtagtg 


acacacttga gaccttatac aacatatctt 


840 


tttgaagttt 


cagctgctac 


aactgaagca 


ggttatattg atagtacgat tgtcagaaca 


900 


ccagaatcag 


ccaccgctgc 


tggtacccag 


gctaacaggg tctggagtag acctctagca 


960 


aactccaaca 


gacctgcagc 


ggagggtcct 




990 



<210> 7 

<211> 5877 

<212> DNA 

<213> Homo sapiens 
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<400> 7 

ctgttaggag gaaaactaac aaacagaaag gacatccaca ccaaaaaccc atctgtacat 60 

caeca tea tc aaagaccaaa agtagataaa accacaaaga tggggaaaaa acagagcaga 120 

aaaactggaa actctaaaaa gcagagcacc tctcctcctc caaaggatgg cagggcagtg 180 

tttgatttac aacttgeaga ggtagaatcc aegcaagtaa gaattacttg gaagaaacca 240 

cgacaaccaa atggaattat taaccaatac cgagtgaaag tgctagttcc agagacagga 300 

ataattttgg aaaatacttt gctcactgga aataatgagt atataaatga ccccatggct 360 

ccagaaattg tgaacatagt agagecaatg gtaggattat atgagggttc agcagagatg 420 

tcgtctgacc ttcactcact tgctacattt atatataaca gccatccaga taaaaacttt 480 

ectgeaagga atagagctga agaccagact tcaccagttg taactacaag gaatcagtat 540 

attactgaca ttgcagctga acagctgtct tatgttatca ggagacttgt acctttcact 600 

gagcacatga ttagtgtatc tgctttcacc atcatgggag aaggaccacc aacagttctc 660 

agtgfctagga cacgtcagca agtgccaagc tccattaaaa ttataaacta taaaaatatt 720 

agttcttcat ctattttgtt atattgggat cctccagaat atcccaatgg aaaaataact 780 

cactatacga tttatgcaat ggaattggat acaaacagag cattccagat aactaccata 840 

gataacagct ttctcataac agggttaaag aaatacacaa aatacaaaat gagagtggca 900 

gcctcaaccc acgttggaga aagttctttg tctgaagaaa atgacatctt tgtgagaact 960 

tcagaagatg aaceggaate atcacctcaa gatgtcgaag taattgatgt tacegcagat 1020 

gaaataaggt tgaagtggtc accacccgaa aagcccaatg ggatcattat tgcttatgaa 1080 

gtgetatata aaaatataga tactttatat atgaagaaca catcaacaac agacataata 1140 

ttaaggaact taagacctca caccctctat aacatttctg taaggtctta caccagattt 1200 

ggtcatggca atcaggtatc ttctttactc tctgtaagga etteggagae tgtgcctgat 1260 

agtgcaccag aaaatatcac ttacaaaaat atttcttctg gagagattga gctatcattc 1320 

cttcccccaa gtagtcccaa tggaatcata caaaaatata caatttatct caagagaagt 1380 

aatggaaatg aggaaagaac tataaataca acctctttaa cccaaaacat taaagtactg 1440 

aagaaatata cccaatatat cattgaggtg tetgetagta cactcaaagg tgaaggagtt 1500 

eggagtgetc ccataagtat actgaeggag gaagatgetc ctgattctcc ccctcaagac 1560 

ttctctgtaa aacagttgtc tggtgtcacg gtgaagttgt catggcaacc acccctggag 1620 

ccaaatggaa ttatccttta ttacacagtt tatgtctgga atagatcatc attaaaaact 1680 
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attaatgtca ctgaaacatc attggagtta tcagatttgg attataatgt tgaatacagt 1740 

gcttatgtaa cagctagcac cagatttggt gatgggaaaa caagaagcaa tatcattagc 1800 

tttcaaacac cagagggagc accaagcgat cctcccaaag atgtttatta tgcaaacctc 1860 

agttcttcat caataattct tttctggaca cctccttcaa aacctaatgg gattatacaa 1920 

tattactctg tttattacag aaatacttca ggtactttta tgcagaattt tacactccat 1980 

gaagtaacca atgactttga caatatgact gtatccacaa ttatagataa actgacaata 2040 

tt cage tact atacattttg gttaacagca agtacttcag ttggaaatgg gaataaaagc 2100 

agtgacatca ttgaagtata cacagatcaa gacatacctg aagggtttgt tggaaacctg 2160 

acttacgaat ccatttcgtc aactgeaata aatgtaagct gggtcccacc ggctcaacca 2220 

aacggtctag tcttctacta tgtttcactg atcttacagc agactcctcg ccatgtgaga 2280 

ccacctcttg ttacatatga gagaagcata tattttgata atctggaaaa atacactgat 2340 

tatatattaa aaattactcc atcaacagaa aagggattct ctgataccta tactgcccag 2400 

ctatacatca agactgaaga agatgtccca gaaacttcac caataatcaa cacttttaaa 2460 

aacctttcct ctacctcagt tctcttatca tgggatcccc cagtaaagee aaatggtgca 2520 

ataataagtt atgatttaac tttacaagga ccaaatgaaa attattcttt cattacttct 2580 

gataattaca taatattgga agagctttca ccatttacat tatatagctt ttttgetgee 2640 

gcaagaacta gaaaaggact tggtccttcc agtattcttt tcttttacac agatgagtca 2700 

gtgccgttag cacctccaca aaatttgact ttaatcaact gtacttcaga ctttgtatgg 2760 

ctgaaatgga gcccaagtcc tcttccaggt ggtattgtta aagtatatag ttttaaaatt 2820 

catgaacatg aaactgacac tatatattat aagaatatat caggatttaa aactgaagee 2880 

aaacttgttg gactggaacc agtcagcacc tactctatcc gtgtatctgc gttcaccaaa 2940 

gttggaaatg gcaatcaatt tagtaatgta gtaaaattca caacccaaga atcagttcca 3000 

gatgtcgtgc agaatatgea gtgcatggca actagctggc agtcagtttt agtgaaatgg 3060 

gatccaccca aaaaggcaaa tggaataata aegcagtata tggtaacagt tgaaaggaat 3120 

tctacaaaag tttctcccca agatcacatg tacactttca taaagcttct tgccaatacc 3180 

tcatatgtct ttaaagtaag agcttcaacc tcagctggtg aaggtgatga aagcacatgc 3240 

catgtcagca cactacctga aacagttccc agtgttccca caaatattgc tttttctgat 3300 

gttcagtcaa etagtgeaac attgacatgg ataagacctg acactatcct tggctacttt 3360 

caaaattaca aaattaccac teaacttegt gctcaaaaat gcaaagaatg ggaatccgaa 3420 

gaatgtgttg aatatcaaaa aattcaatac ctctatgaag ctcacttaac tgaagagaca 3480 
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gtatatggat taaagaaatt tagatggtat agattccaag tggctgccag caccaatgct 3540 

ggctatggca atgcttcaaa ctggatttct acaaaaactc tgcctggccc tccagatggt 3600 

cctcctgaaa atgttcatgt agtagcaaca tcacctttta gcatcagcat aagctggagt 3660 

gaacctgctg tcattactgg accaacatgt tatctgattg atgtcaaatc ggtagataat 3720 

gatgaattta atatatcctt catcaagtca aatgaagaaa ataaaaccat agaaattaaa 3780 

gatttagaaa tattcacaag gtattctgta gtgatcactg catttactgg gaacattagt 3840 

gctgcatatg tagaagggaa gtcaagtgct gaaatgattg ttactacttt agaatcagcc 3900 

ccaaaggacc cacctaacaa catgacattt cagaagatac cagatgaagt tacaaaattt 3960 

caattaacgt tccttcctcc ttctcaacct aatggaaata tccaagtata tcaagctctg 4020 

gtttaccgag aagatgatcc tactgctgtc cagattcaca acctcagtat tatacagaaa 4080 

accaacacat tcgtcattgc aatgctagaa ggactaaaag gtggacatac atacaatatc 4140 

agtgtttacg cagtcaatag tgctggtgca ggtccaaagg ttccgatgag aataaccatg 4200 

gatatcaaag ctccagcacg accaaaaacc aaaccaaccc ctatttatga tgccacagga 4260 

aaactgcttg tgacttcaac aacaattaca atcagaatgc caatatgtta ctaoagtgat 4320 

gatcatggac caataaaaaa tgtacaagtg cttgtgacag aaacaggagc tcagcatgat 4380 

ggaaatgtaa caaagtggta tgatgcatat tttaataaag caaggccata ttttacaaat 4440 

gaaggctttc ctaaccctcc atgtacagaa ggaaagacaa agtttagtgg caatgaagaa 4500 

atctacatca taggtgctga taatgcatgc atgattcctg gcaatgaaga caaaatttgc 4560 

aatggaccac tgaaaccaaa aaagcaatac ttatttaaat ttagagctac aaatattatg 4620 

ggacaattta ctgactctga ttattctgac cctgttaaga ctttagggga aggactttca 4680 

gaaagaaccg tagagatcat tctttccgtc actttgtgta tcctttcaat aattctcctt 4740 

ggaacagcta tttttgcatt tgcaagaatt cgacagaagc agaaagaagg tggcacatac 4800 

tctcctcagg atgcagaaat tattgacact aaattgaagc tggatcagct catcacagtg 4860 

gcagacctgg aactgaagga cgagagatta acgcggttac ttagttatag aaaatccatc 4920 

aagccaataa gcaagaaatc cttcctgcaa catgttgaag agctttgcac aaacaacaac 4980 

ctaaagtttc aagaagaatt ttcggaatta ccaaaatttc ttcaggatct ttcttcaact 5040 

gatgctgatc tgccttggaa tagagcaaaa aaccgcttcc caaacataaa accatataat 5100 

aataacagag taaagctgat agctgacgct agtgttccag gttcggatta tattaatgcc 5160 

agctatattt ctggttattt atgtccaaat gaatttattg ctactcaagg tccactacca 5220 
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ggaacagfctg gagatttttg gaaaatggtg tgggaaacca gagcaaaaac attagtaatg 5280 

ctaacacagt gttttgaaaa aggacggatc agatgccatc agtattggcc agaggacaac 5340 

aagccagtta ctgtctttgg agatatagtg attacaaagc taatggagga tgttcaaata 5400 

gattggacta tcagggatct gaaaattgaa aggcatgggg attgcatgac tgttcgacag 5460 

tgtaacttta ctgcctggcc agagcatggg gttcctgaga acagcgcccc tctaattcac 5520 

tttgtgaagt tggttcgagc aagcagggca catgacacca cacctatgat tgttcactgc 5580 

agtgctggag ttggaagaac tggagttttt attgctctgg accatttaac acaacatata 5640 

aatgaccatg attttgtgga tatatatgga ctagtagctg aactgagaag tgaaagaatg 5700 

tgcatggtgc agaatctggc acagtatatc tttttacacc agtgcattct ggatctctta 5760 

tcaaataagg gaagtaatca gcccatctgt tttgttaact attcagcact tcagaagatg 5820 

gactctttgg acgccatgga aggtgatgtt gagcttgaat gggaagaaac cactatg 5877 

<210> 8 

<211> 412 

<212> DNA 

<213> Homo sapiens 

<400> 8 

tttactgctfc aaaatctagg tgggtgtgag gccatgactg tgaacgttgt cagtcactga 60 

ctgtattcag ctgtaaaggc ttgacaaggt gagaaaaacc tcccacactt ttaaaatgaa 120 

tgaaatgttt gaacactaag tagtcataca caaaataatt accttaattt catatgttgt 180 

tccaggatta agattagtaa gaagaaottc cagactgtct ggctttgatc tgacttgtgt 240 

atatactgtt tgcatccacg gacaaacttc cttatatttg acaatgtaag atataatcct 300 

tccatttgga ttaggtggtg tattccaaga cagaagaatc ccagcagatc cgactctttc 360 

cccggctagg aagactggag gccctggttc tatttttaaa aaatccattt tg 412 
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